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flTT T aPPr ° aCh ' 0r m ° nil0rin9 P roteln "P™tein interactions within 
mac, eukaryottc cells, in which enzymatic B-gal activity serves to monitor the 
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BACKGROUND OF THE INVENTION 

The present invention concerns a method for selecting a 
molecule, and kit thereof, a method for screening a molecule, 
and kit thereof, and a signal amplification system comprising 
a bacterial multi-hybrid system. 

The present invention relates to a signal amplification 
system comprising a bacterial multi-hybrid system, and more 
preferably a two-hybrid system, of at least two chimeric 
polypeptides containing a first chimeric polypeptide 
corresponding to a first fragment of an enzyme and a second 
chimeric polypeptide corresponding to a second fragment of an 
enzyme or a modulating substance capable of activating said 
enzyme, wherein the first fragment is fused to a molecule of 
interest and the second fragment or the modulating substance 
is fused to a target ligand, and wherein the activity of the 
enzyme is restored by the interaction between the said 
molecule of interest and the said target ligand, and wherein 
a signal amplification is generated. 

The present invention also relates to a method of 
selecting a molecule of interest, which is capable of binding 
to a target ligand, wherein the interaction between the said 
molecule of interest and the said target ligand is detected 
25 with a signal amplification system according to the 
invention, by means of generating a signal amplification and 
triggering transcriptional activation. 

The present invention also relates to a method of 
screening for a substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest, wherein respectively the stimulating or 
the inhibiting activity is detected with a signal 
amplification system according to the invention, by means of 
generating a signal amplification and respectively of 
triggering or of abolishing transcriptional activation, and 
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wherein said signal amplification and said triggered or 
abolished transcriptional activation are compared with those 
obtained from an identical signal amplification system 
without any substance. 

Most biological processes involve specific protein- 
protein interactions. General methodologies to identify 
interacting proteins or to study these interactions have been 
extensively developed. Among them, the yeast two-hybrid 
system currently represents the most powerful in vivo 
approach to screen for polypeptides that could bind to a 
given target protein. Originally developed by Fields and 
coworkers [Fields, S. * Song, o. (1989) Mature 340, 245-6; 
Chien, c. T., Bartel, P. L . , Sternglanz, R. & Fields, S 
(1991) Proc. Natl. Acad. Sci . USA. 88, 9578-82. Two American 
Patents n° 5,283,173 granted on February 1, 1994 (Fields, S. 
& Song, o.) and n° 5,4 68,614 granted on November 21,1995 
(Fields, S. & song, o.) are also incorporated by reference], 
it utilizes hybrid genes to detect protein-protein 
interactions by means of direct activation of a reporter-gene 
expression (Allen, J. B . , Walberg, M . w., Edwards, M. C. L 
Elledge, S. J. (1995) Trends Biochem. Sci. 20, 511-6; Transy, 
C & Legrain, P. (19 95 ) Mol. Biol. Rep. 21, 119-27). 

In essence, the two putative protein partners are 
genetically fused to the DNA-binding domain of a 
transcription factor and to a transcriptional activation 
domain, respectively. A productive interaction between the 
two proteins of interest will bring the transcriptional 
activation domain in the proximity of the DNA-binding domain 
and will trigger directly the transcription of an adjacent 
reporter gene (usually lacZ or a nutritional marker) giving 
a screenable phenotype. As there is evidence that the 
transcription can be activated through the use of two 
functional domains of a transcription factor: a domain that 
recognizes and binds to a specific site on the DNA and a 
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domain that is necessary tor accivation , as re 

tv* 1 - (i986) s ™ ^ ^-^ Ma and Pt . shn . 

(1987) CeiJ 48, 847-853. 



A. £ 



Recently, Rossi « al . (Rossi , F ., charlton> c 
Blau, H. „. (1997) Proc . WatJ ^ ^ g 

descried a different approach, a mammalian "two-hybrid" 
astern, which uses P-galactosidase complementation (UUnann 
A., Jacob, F . S Monod, j. (1 968) J. Mol . sioJ . 32 _ ' 
-niter protein-protein interactions in intact eu.aryotic 



Phage display (Smith, G P fiaoM c „,- 0 „ 0 

uw, r . (iyoi) Science 228, 1315-7- 

^ I' ' SmUh ' G ' P - (1990) scien " 2 < 9 ' 386 - 90 ' 

T l ZT g assay (Germino ' F - J - Wang < z - *• « - 

Item ^ ^ ^ ^ 9 °' "P™" 

alternative approaches to screen complex libraries of 

P«ot«„. for direct interaction with a given l igan d. However, 
these techniques do not allow an in vivo selection of the 
relevant clones. 

Another approech is described in the International 
Patent Application n° WO 96/40987 <Scha tZ ,P. J e t al ) 
which provides random peptide libraries and methods for 
generating and screening libraries to identify peptides that 
bind to receptor molecules of interest, induding antibodies. 
The peptide library is constructed so that the DNA binding 
protean-random peptide fusion product can bind to the 
recount DNA expression vector that encodes the fusion 
Product that contains the peptide of interest. The method of 
aerating the peptide library comprises the steps of ,., 
constructing a recombinant DNA vector that encodes a DNA 
bindm, protein and contains binding sites for the DNA 

22 r° tSin; <W inSertin ' int ° ChS C ° din * »0-nc. of 
tne DNA binding protein in a multiplicity of vectors of step 

C ° dlnq se< 5uences for random peptides such that the 
resulting vectors encode different fusion proteins, each of 
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which is composed of the DNA binding protein and a random 
peptide; (c) transforming host cells with the vectors of 
steps (b); and (d) culturing the host cells transformed in 
step (C ) under conditions suitable for expression of the 
fusion proteins. Typically, a random peptide library will 
contain at least 10« to 10* different members, although 
library si 2es of 10° to io» can be achieved. 

A novel variety of approach is defined in the 
International Patent Application n° WO 96/29429 (Wickens, M. 
& Fields, S.) related to a hybrid system to detect protein- 
RNA interactions using the same method of achievement as 
recited in the two above-mentioned American patents. This 
hybrid system has a first hybrid protein comprising a DNA- 
bmdmg domain and a first RNA-binding domain, a second 
hybrid protein comprising a transcriptional activation domain 
and a second RNA-binding domain, and a hybrid RNA. The 
interaction between both the first RNA-binding domain and the 
hybrid RNA and the second RNA-binding domain and the hybrid 
RNA causes the transcriptional activation domain to activate 
transcription of the detectable gene. 

Bartel, P. L . , Roecklein, J. a., SenGupta, D. & Fields, 
S- (1996) Nat. Genet. 12, 72-77 extended the approach of the 
typical two-hybrid system consisting in a known protein that 
forms a part of a DNA-binding domain hybrid, assayed against 
a library of all possible proteins present as transcriptional 
activation domain hybrids, using the genome of the 
bacteriophage T7, such that a second library of all possible 
proteins is fused to the DNA-binding domain to be analyzed. 
This genome-wide approach to the two-hybrid searches has 
identified 25 interactions among the proteins of T7 . 

SUMMARY OF THE INVENTION 

The aim of the present invention is to provide a novel 
bacterial multi-hybrid system, and more preferably a two- 
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hybrid system, in which proteins of interest are genetically 
fused to two complementary fragments of a catalytic domain of 
an enzyme, which provides significant advantages over the 
prior art. 

Thus, the present invention provides a signal 
amplification system comprising a bacterial multi-hybrid 
system, and more preferably a two-hybrid system, of at least 
two chimeric polypeptides containing a first chimeric 
polypeptide corresponding to a first fragment of an enzyme 
and a second chimeric polypeptide corresponding to a second 
fragment of an enzyme or a modulating substance capable of 
activating said enzyme, wherein the first fragment is fused 
to a molecule of interest and the second fragment or the 
modulating substance is fused to a target ligand, and wherein 
the activity of the enzyme is restored by the interaction 
between the said molecule of interest and the said target 
ligand, and wherein a signal amplification is generated. 

This system allows an easy in vivo screening and 
selection of functional interactions between the target 
ligand and the molecule of interest. 

A genetic test is based on the reconstitution, in a 
specific enzyme deficient bacteria, of a signal transduction 
pathway that takes advantage of the positive control exerted 
by a signaling molecule. Association of the target ligand and 
25 the molecule of interest results in functional 
complementation between the two chimeric polypeptides and 
leads to the signaling molecule synthesis. The signaling 
molecule then triggers transcriptional activation of 
catabolic operons, of a gene conferring resistance to 
antibiotics, of a gene encoding for a toxin or of a color 
marker, such as a fluorescent marker of the type of the Green 
Fluorescent Protein (GFP) that yields a characteristic 
phenotype. In this genetic test of screening and/or 
selection, the involvement of a signaling cascade offers the 
unique property that association between the chimeric 
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polypeptides can be spatially separated from the 
transcriptional activation readout. This permits a versatile 
design of screening procedures either for ligands that bind 
to a given "bait", as in the classical yeast multi-hybrid 
system, or for molecules or mutations that block a given 
interaction between two proteins of interest. 

Furthermore, because the signal amplification system 
according to the invention involves the generation of at 
least one signaling molecule, also called regulatory 
molecule, the physical association of the two putative 
interacting target ligand and molecule of interest can be 
spatially separated from the transcriptional events that are 
dependent on regulatory molecule synthesis. This means that 
the interaction between a target ligand and a molecule of 
interest under study does not need to take place in the 
vicinity of the transcription machinery as is the case for 
the yeast two-hybrid system as described above. Hence, in 
addition to the methods described above, the present 
invention allows one to analyze more particularly protein 
interactions that occur either in the cytosol or at the inner 
membrane level. 

Another advantage of the present invention over the 
prior art is that this bacterial system is particularly 
versatile as it offers the possibility of both positive and 
negative selections. Positive selection means bacterial 
growth, for example, on minimal medium containing lactose or 
maltose. Negative selection means arrest of growth. 

The present invention further relates to a method of 
selecting a molecule of interest, which is capable of binding 
to a target ligand, wherein the interaction between the said 
molecule of interest and the said target ligand is detected 
with a signal amplification system according to the 
invention, by means of generating a signal amplification and 
triggering transcriptional activation. 
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The present invention also relates to a method of 
screening for a substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest, wherein the stimulating or the 
5 inhibiting activity is detected with a signal amplification 
system according to the invention, by means of generating a 
signal amplification and triggering transcriptional 
activation; and wherein said signal amplification and said 
triggering transcriptional activation are compared with those 
10 obtained from an identical signal amplification system 
without any substance. 

The present invention also provides a kit for selecting 
a molecule of interest, wherein said kit comprises: 

(a) a signal amplification system according to the 
15 invention; 

(b) an E. coli strain, or a bacterial strain, or an 
eukaryotic cell deficient in endogenous adenylate cyclase; 
and 

(c) a medium allowing the detection of the 
20 complementation selected from the group consisting of 

indicator or selective medium, for example, as minimal medium 
supplemented with lactose or maltose as unique carbon source, 
medium with antibiotics, medium to visualize fluorescence, 
conventional medium, and medium that allows the sorting by 
25 the presence of the phage receptor. 

Further, the present invention also provides a kit for 
selecting a molecule of interest, wherein said kit comprises: 

(a) a signal amplification system according to the 
invention, wherein the molecule of interest is a mutant 

30 molecule compared to the known wild type molecule; 

(b) a signal amplification system according to the 
invention, wherein the molecule of interest is the known wild 
type molecule as the control; 
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(c) E. coli strain, or in any bacterial strain deficient 
in endogenous adenylate cyclase, or any other eukaryotic 
cell; 

(d) a medium allowing the detection of the 
5 complementation selected from the group consisting of 

indicator plate or selective medium as minimal medium 
supplemented with lactose or maltose as unique carbon source 
medium with antibiotics, medium to visualize fluorescence, 
conventional medium, and medium that allows the sorting by 
10 the presence of the phage receptor for each signal 
amplification system; and 

(e) means for detecting whether the signal amplification 
system with the mutant molecule is enhanced or inhibited with 
respect to the signal amplification system with the wild type 

15 molecule. 

The present invention also provides a kit for screening 
for a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest, wherein said kit comprises: 
20 < a > a signal amplification system according to the 

invention with the substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest; 

(b) a signal amplification system according to the 
25 invention without any substance as the control; 

<c) E. coli strain, or in any bacterial strain deficient 
in endogenous adenylate cyclase, or any other eukaryotic cell 
and; 

(d) a medium allowing the detection of the 
complementation selected from the group consisting of 
indicator plate or selective medium as minimal medium 
supplemented with lactose or maltose as unique carbon source, 
medium with antibiotics, medium to visualize fluorescence, 
conventional medium, and medium which allows the sorting by 
the presence of the phage receptor; 
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(e) means for detecting whether the signal amplification 
system with the substance is enhanced or inhibited with 
respect to the signal amplification system without any 
substance. 

According to one embodiment of the present invention, 
the signal amplification system comprises a bacterial multi- 
hybrid system, and more preferably a two-hybrid system, 
containing a first chimeric polypeptide corresponding to a 
first fragment of an enzyme, a second chimeric polypeptide 
corresponding to a second fragment of an enzyme or a 
modulating substance capable of activating said enzyme, and 
a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest . 

The present invention also provides a method of 
screening for a substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest, wherein the stimulating or the 
inhibiting activity is detected with a signal amplification 
system according to the invention, by means of generating a 
signal amplification and triggering transcriptional 
activation, and wherein said signal amplification and said 
triggering transcriptional activation are compared with those 
obtained from an identical signal amplification system 
without any substance. 

BRIEF DESCRIPTION OF THE DRAWINGS 

This invention will be described in greater detail with 
reference to the drawings in which: 

Figures 1A, IB, ic and ID depict the principle of an E. 
coli multi-hybrid system based on functional complementation 
of the catalytic domain of Bordetella adenylate cyclase 
(CyaA) fragments. 
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The upper part schematizes the basic principle of in 
vivo complementation between the two fragments of the 
catalytic domain of B. pertussis adenylate cyclase. The two 
boxes represent the T25 and T18 fragments corresponding to 
amino acids 1 to 224 and 225 to 399 of the CyaA protein. In 
figure 1A, the full-length catalytic domain (residues 1 to 
399), when expressed in E. coli, exhibits a basal calmodulin- 
independent activity that results in cyclic adenosine 
monophosphate (cAMP) synthesis. In figure IB, the two 
fragments T25 and T18, when coexpressed as independent 
polypeptides, are unable to interact and no cAMP synthesis 
occurs. In figure 1C, the two fragments, fused to two 
interacting proteins, X and Y, are brought into close 
proximity resulting in functional complementation, followed 
15 by cAMP production. 

The lower part schematizes the readout of the 
complementation. cAMP, synthesized in an E. coli cya strain 
by the complementing T25 and T18 pairs, binds to the 
catabolite activator protein, CAP. The cAMP/CAP complex (C) 
can then recognize specific promoters and switch on the 
transcription of the corresponding genes. These reporter 
genes can be either natural E. coli genes, such as lacZ or 
mal genes, or synthetic ones, such as antibiotic resistance 
genes fused to a cAMP/CAP dependent promoter. 

Figure 2 is a schematic representation of plasmids . 
The open boxes represent the open reading frames of ft- 
lactamase (bla) and chloramphenicol acetyl transferase (cat) 
genes. The dark boxes correspond to the open reading frame of 
cyaA' with codon numbers indicated below. The hatched boxes 
correspond to the multicloning site sequences (MCS) that are 
fused at the indicated position of the cya open reading 
frame. The origin of replication of the plasmids is indicated 
by dotted boxes. 



20 



25 



WO 99/28746 




PCT/IB98/02085 



Figure 3.1 and Figure 3.2 are schematic representations 
of" other plasmids. 

The left part represents the maps of the plasmids, with 
the different antibiotic-selectable markers (chloramphenicol 
5 acetyl transferase (cat), aminoglycoside phosphotransf erse 
(/can) and Q-lactamase (bla) , the origin of replication and 
the position of the multicloning site sequences (MCS) 
relative to the T25 and T18 open reading frames. The right 
part describes the nucleotide sequence of the multicloning 
0 site sequences (MCS) fused to T25 (Fig. 3.2) or T18 (Fig. 
3.1) and the corresponding reading frames. 

Figures 4A and 4B depict the results of screening of 
interacting proteins with the bacterial two-hybrid system. 

DHPI cells were cotransf ormed with a mixture of plasmids 
5 pT18, pT18-zip, and PT18-Tyr, and either pT25 (Fig. 4 A) or 
pT25-zip (Fig. 4B) , plated on LB-X-Gal agar plates containing 
0.5 mM IPTG, ampicillin and chloramphenicol, and incubated 
for 30 hrs at 30°C. Note that the cya* colonies are larger 
than the cya ones. 
0 Figure 5 relates to the mapping of interacting domains 

of the B. stearothermophllus tyrosyl-tRNA synthetase. 

DNA fragments encoding the indicated polypeptide 
segments of the tyrosyl-tRNA synthetase (the numbers 
correspond to the amino acid residues) were amplified by PCR 
5 using appropriate primers and cloned into pT25 and/or pT18. 
The functional complementation between the indicated chimeric 
proteins was assayed on DHPI cells co- transformed with the 

corresponding plasmids by measuring the P-galactosidase 
activity . 

►0 Figure 6 relates to the mapping of interacting domains 

of B. pertussis B.vgA. 

DNA fragments encoding indicated polypeptide segments of 
BvgA (the numbers correspond to the amino acid residues) were 
amplified by PCR using appropriate primers and cloned into 
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pKT25 and/or pUTT18C. The functional complementation between 
the indicated chimeric proteins was assayed on DHP1 cells co- 
transformed with the corresponding plasmids by measuring the 
P-galactosidase activity. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

Thus, the present invention provides a novel signal 
amplification system in Escherichia coli, in which the 
proteins of interest are genetically fused to two 

10 complementary fragments of the catalytic domain of BordeCeJia 
pertussis adenylate cyclase (Ladant, D. (1988) J. Biol. Chem. 
263, 2612-2618; Ladant, D., Michelson, S., Sarfati, R. S., 
Gilles, A.-M., Predeleanu, R. & Blrzu, 0. (1989) J. Biol. 
Chem. 264, 4015-4020) . 

15 5. pertussis produces a calmodulin dependent adenylate 

cyclase toxin encoded by the cyaA gene (Hewlett, E. L . , 
Urban, M . A., Manclark, C. R. & Wolff, J. (1976) Proc . Natl. 
Acad. Sci. U.S.A. 73, 1926-1930; Glaser, P., Ladant, D., 
Sezer, O., Pichot, F . , Ullmann, A. & Danchin, A. (1988) Mol . 

20 Microbiol. 2, 19-30; Mock, M. & Ullmann, A. (1993) Trends 
Microbiol. 1, 187-192). The catalytic domain is located 
within the first 400 amino acids of this 1706 residue-long 
protein (Ladant, D., Michelson, S., Sarfati, R . S., Gilles, 
A.-M., Predeleanu, R. & Blrzu, O. (1989) J. Biol. Chem. 264, 

25 4015-4020; Glaser, P., Ladant, D., Sezer, 0., Pichot, F. , 
Ullmann, A. & Danchin, A. (1988) Mol . Microbiol. 2, 19-30). 
It exhibits a high catalytic activity (kcat = 2000 s-1) in 
the presence of calmodulin (CaM) , and a low but detectable 
activity (kcat = 2 s-1) in the absence of this activator 

30 (Ladant, D. (1988) J. Biol. Chem. 263, 2612-2618; Wolff, J., 
Cook, G. H., Goldhammer, A. R. & Berkowitz, S. A. (1980) 
Proc. Natl. Acad. Sci. USA. 77, 3841-3844). 

Biochemical studies revealed that the catalytic domain 
can be proteolyt ically cleaved into two complementary 
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fragments, T25 and T18, that remain associated in the 
presence of CaM in a fully active ternary complex (Ladant, D. 

(1988) J. Biol. Chem. 263, 2612-2618; Ladant, D., Michelson, 
S., Sarfati, R. S., Gilles, A.-M., Predeleanu, R. & Blrzu, O. 

(1989) J. Biol. Chem. 264, 4015-4020; Munier, H., Gilles, A. 
M., Glaser, P., Krin, E . , Danchin, A., Sarfati, R. & Barzu, 
0. (1991) Eur. J. Biochem. 196, 469-7/]). in the absence of 
CaM, the mixture of the two fragments did not exhibit 
detectable activity suggesting that the two fragments are not 
able to reassociate to yield basal CaM- independent activity. 

The two complementary fragments, T25 and T18, that are 
both necessary to form an active enzyme, in the presence of 
CaM when expressed in E . coli as separated entities, are 
unable to recognize each other and cannot reconstitute a 
functional enzyme. However, when T25 and T18 are fused to 
peptides or proteins that are able to interact, 
heterodimerization of these chimeric polypeptides results in 
a functional complementation between the adenylate cyclase 
fragments. 

When expressed in an adenylate cyclase deficient E. coli 
strain (E. coli lacks CaM or CaM-related proteins), the T25 
and T18 fragments fused to putative interacting proteins 
reassociate and lead to cAMP synthesis (Figures 1A, IB and 
1C) . 

Interaction between a target ligand and a molecule of 
interest results in functional complementation between the 
two adenylate cyclase fragments leading to cAMP synthesis, 
which in turn can trigger the expression of several resident 
genes. Using this assay, one can select specific clones 
expressing a protein that interacts with a given target by a 
simple genetic screening. 

The present invention provides a signal amplification 
system comprising a bacterial multi-hybrid system, and more 
preferably a two-hybrid system, of at least two chimeric 
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polypeptides containing a first chimeric polypeptide 
corresponding to a first fragment of an enzyme, and a second 
chimeric polypeptide corresponding to a second fragment of an 
enzyme or a modulating substance capable of activating said 
enzyme, wherein the first fragment is fused to a molecule of 
interest and the second fragment or the modulating substance 
is fused to a target ligand, and wherein the activity of the 
enzyme is restored by the interaction between the said 
molecule of interest and the said target ligand, and wherein 
a signal amplification is' generated. 

"Signal amplification system" means a system involving 
the interaction between at least two chimeric polypeptides 
leading to the production of a large number of signaling 
molecules . 

"Signal amplification" means, in the present invention, 
that the number of signaling molecules is higher than the 
number of chimeric polypeptides that produced it. 

The first fragment and the second fragment are issued 
from the same enzyme or not. In any case, the first and the 
second fragments are distinct from each other even if they 
are issued from the same enzyme. For example, the fragments 
are from residues 1 to 224 and 225 to 399 from B. Pertussis 
adenylate cyclase. 

A fragment issued from the enzyme comprises between 20 
25 and 400 amino acid residues and more preferably 200 
consecutive amino acid residues. 

"Modulating substance" refers to a substance capable of 
activating or inhibiting an enzyme, which is an activator, 
natural or not, of the enzyme a fragment thereof, or a 
derivative of the activator; the enzyme having a modulating 
substance-binding site. In a preferred embodiment of the 
invention, the modulating substance is a natural activator 
as, for example, the calmodulin. 

The fragments and the modulating substance are fused, 
respectively, to the molecule of interest or to the target 
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ligand by means of genetic recombination as described herein 
after. A proteolytic cleavage site can be introduced, 
according to the well known techniques, in the genetic 
construction between a fragment of the enzyme and a molecule 
5 of interest in order to eliminate easily, after the 
generation of the signal amplification, by restriction enzyme 
the fragment and to recover the molecule of interest. 

The molecule of interest can be detected for example 
from cDNA, genomic, or synthetic random DNA libraries. 

10 Th e restoration of the enzymatic activity means that an 

enzyme activity is recovered. 

The interaction between the molecule of interest and the 
target ligand means that there exists a recognition which 
could possibly lead to the binding between the molecule of 

15 interest and the target ligand. 

According to the invention, the enzyme is selected from 
the group consisting of adenylate cyclase and guanylate 
cyclase from any origin. Any origin refers to Bordetella 
species or any other organism that produces this type of 

20 enzyme. 

In one specific illustration of the present invention 
the enzyme is the catalytic domain of Bordeteiia adenylate 
cyclase (CyaA) located within the first 400 amino acid 
residues of the adenylate cyclase toxin. 
25 The present invention also concerns a first fragment and 

a second fragment, which are any combination of fragments 
from the same enzyme, which in vitro functionally interact 
with the natural activator of said enzyme by restoring its 
activity. 

30 According to one embodiment of the invention the first 

and the second fragments are selected from the group 
consisting of : 

(a) a fragment T25 corresponding to amino acids 1 to 224 
of CyaA and a fragment T18 corresponding to amino acids 225 

35 to 399 of CyaA; 
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(b) a fragment corresponding to amino acids 1 to 224 of 
CyaA and a fragment corresponding to amino acids 224 to 384 
of CyaA; 

(c) a fragment corresponding to amino acids 1 to 137 of 
CyaA and a fragment corresponding to amino acids 138 to 400 
of CyaA; 

(d) a fragment corresponding to amino acids 1 to 317 of 
CyaA and a fragment corresponding to amino acids 318 to 400 
of CyaA; and 

(e) two fragments from eukaryotic adenylate cyclase in 
association with molecules, such as, G protein and forskolin. 

According to a preferred embodiment of the invention, 
the first and the second fragments are a fragment T25 
corresponding to amino acids 1 to 224 of Bordetella pertussis 
CyaA and a fragment T18 corresponding to amino acids 225 to 
399 of Bordetella pertussis CyaA. 

According to the invention, the modulating substance is 
a natural activator, or a fragment thereof, of the enzyme. In 
a specific embodiment of the invention, the natural activator 
is the calmodulin (CaM) , or a fragment thereof, and said 
first fragment is mutated compared to the wild type enzyme. 
The fragment of calmodulin is about 70 amino acids long, 
corresponding preferentially, to residues 77 to 148 of 
mammalian calmodulin. 
25 According to another aspect, the invention also concerns 

a first fragment, which is a mutated fragment of the 
catalytic domain of Bordetella adenylate cyclase (CyaA) . 
"Mutated fragment" means that it presents at least one 
mutation in the polynucleotide sequence, said fragment in 
30 combination with a second fragment can then in vivo 
functionally interact only in the presence of the natural 
activator . 

The signal amplification system according to the 
invention comprises a bacterial multi-hybrid system 
35 containing: 
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(a) a first chimeric polypeptide corresponding to a 
first fragment of an enzyme; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme; and 

(c) a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest, wherein the first fragment is fused to a molecule 
of interest and the second fragment or the modulating 
substance is fused to a target ligand, and wherein the 
activity of the enzyme is restored by the interaction between 
the said molecule of interest and the said target ligand, and 
wherein a signal amplification is generated. 

Another aspect of the present invention consists in a 
method of selecting a molecule of interest, which is capable 
of binding to target ligand, wherein the interaction between 
the said molecule of interest and the said target ligand is 
detected with a signal amplification system according to the 
invention, by means of signal amplification which triggers 
transcriptional activation, and is quantified by measuring 
the synthesis of the signaling molecule or the expression of 
the reporter gene. 

This method of selecting a molecule of interest allows 
selection of a molecule capable of interacting directly with 
25 a predetermined target ligand. 

The signal amplification corresponds to the production 
of a signaling molecule. This signaling molecule is any 
molecule capable of leading to a signaling cascade reaction. 

In a preferred embodiment of the invention, the 
signaling molecule corresponds to the synthesis of cAMP. 

In another preferred embodiment of the invention, the 
signaling molecule corresponds to the synthesis of cGMP. 

The transcriptional activation leads to a reporter gene, 
expression of which is selected from the group consisting of 
gene coding for nutritional marker, such as lactose or 
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maltose; gene conferring resistance to antibiotics such as 
ampicillin, chloramphenicol, kanamycin, or tetracyclic a 
gene encoding for a toxin; a color marker, such as, 
fluorescent marker of the type of the Green Fluorescent 
Protein (GFP) ; a gene encoding phage receptor proteins or a 
fragment thereof, such as phage 8 receptor, Jams, and any 
other gene giving a selectable phenotype. 

According to a preferred embodiment of the invention, 
cAMP, upon binding to CAP, is able to activate the 
transcription of catabolic operons, allowing the bacteria to 
ferment carbohydrates, such as maltose or lactose, and to 
express the phage 8 receptor, protein LamB, which could serve 
as a marker at the bacterial surface. This signal 
amplification system comprising this bacterial multi-hybrid 
system is able to reveal, for example, interactions between 
small peptides (GCN4 leucine zipper), bacterial (tyrosyl tRNA 
synthetase), or eukaryotic proteins (yeast Prpll/Prp21 
complex) . 

Accordingly, specific reporter cassettes in which any 
gene of interest is fused to a cAMP/CAP dependent promoter 
can be designed. Thus, to facilitate the screening and the 
selection of complex libraries, the construction of such a 
simple selection system using an antibiotic resistance gene 
can be performed. 

The reporter gene could be a toxin, not naturally 
present in bacteria, under the control of a cAMP/CAP- 
dependent promoter. This could be particularly useful to 
search for chemical compounds or mutations that abolish a 
given interaction between the target ligand and a molecule of 
interest. According to this construction, when association 
between the target ligand and a molecule of interest takes 
place, cAMP will be produced, the expression of the toxin 
gene will be switched on, and the cells will be killed. A 
substance capable of stimulating or inhibiting the 
interaction between the target ligand and the molecule of 
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interest and that abolishes interaction will shut down toxin 
gene expression and will enable the cells to grow. An easy 
selection for substances that abolish interaction between the 
target ligand and the molecule of interest is resistance to 
Phage 8. The phage receptor, the LamB protein, is the product 
of the lamB gene, which is part of the maltose regulon, 
therefore its expression requires cAMP. In consequence, cells 
producing cAMP will l yse when infected with 8 vir. Substances 
that abolish interaction between the target ligand and the 
molecule of interest will abrogate cAMP synthesis and 
bacteria will become resistant to phage 8 . As a result, the 
cells will grow. 

Another selection scheme for compounds or mutations that 
abolish a given interaction could be designed by constructing 
a strain that harbors a selectable marker (i.e. a gene 
conferring resistance to antibiotics such as ampicillin 
chloramphenicol, kanamycin, tetracyclin, etc.) under the 
transcriptional control of a promoter that is repressed by 
CAMP/CAP. such cAMP/CAP repressed promoter can be engineered 
by introducing a synthetic CAP binding site within the 
promoter region as shown by Morita et al. (Morita T, 
Shigesada K., Kimizuka F., Aiba K. (1988), "Regulatory effect 
of a synthetic CRP recognition sequence placed downstream of 
a promoter," Nucleic Acids Res. 16:7315-32). 

The International Patent Applications n° WO 96/23898 
(Thastrup 0. et al . ) and n° WO 97/11094 (Thastrup O. et al.), 
respectively, relating to a method of detecting biologically 
active substances as Green Fluorescent Protein (GFP) , and the 
International Patent Application n° WO 97/07463 (Chalfie M. 
et al.) describing the uses of GF?, are herein incorporated 
by reference, and a novel variant of GFP. 

In one specific illustration of the present invention, 
the method of selecting a molecule of interest consists in a 
signal amplification system, which comprises a bacterial 
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muiti-hybrid system of at least two distinct fragments o£ an 
enzyme, whos e enzymatic activity is ^stored by the 
interaction between the said molecule of interest and the 
" tar9et li9and ' The <«° fragments are any combination of 
fragments from the enzyme, which in vitro functionaliy 
interact w ith the natural activator of said enzyme by 
restoring its activity. 

According to the method of selecting a molecule of 
interest of the present invention, the fragments are selected 
from the group consisting' of : 

(a) a fragment T25 corresponding to amino acids 1 to 224 
of CyaA and a fragment T18 corresponding to amino acids 225 
to 399 of CyaA; 

(b) a fragment corresponding to amino acids 1 to 224 of 
CyaA and a fragment corresponding to amino acids 224 to 384 
of CyaA; 

(c) a fragment corresponding to amino acids 1 to 137 of 
CyaA and a fragment corresponding to amino acids 138 to 4 00 
of CyaA; 

(d) a fragment corresponding to amino acids 1 to 317 of 
CyaA and a fragment corresponding to amino acids 318 to 400 
of CyaA; and 

(e) two fragments from eukaryotic adenylate cyclase in 
association with molecules, such as G protein and forskolin 

And more particularly, the fragments are a fragment T25 
corresponding to amino acids 1 to 224 of Bordeteila pertussis 
CyaA and a fragment T18 corresponding to amino acids 225 to 
399 of Bordetella pertussis CyaA. 

In another specific illustration of the present 
invention, the method of selecting a molecule of int-rest 
consists in a signal amplification system, which comprises a 
bacterial multi-hybrid system of at least a first fragment of 
an enzyme and a modulating substance, whose activity, which 
is an enzymatic activity, is restored by the interaction 
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between the said molecule „ f lnterest ^ 

ligand. y 

In both of the above illustrations of the present 
invention, the enz yme is seiected from the group consisting 
of adenylate cyclase and guanylate cyclase from any origin 

and more preferably the enzyme is the catalytic domain of 
Bor deteJia adenylate cyclase (CyaA) iocated w . thin ^ f _^ t 

400 amino acid residues of the adenylate cyclase toxin. 

The target ligand according to the invention is selected 
from the group consisting " of protein, peptide, polypeptide, 
receptor, li gan d, antigen, antibody, DMA binding protein 
glycoprotein, lipoprotein and recombinant protein 

"Peptide" or "polypeptide" or "protein" refers to a 
polymer in which the monomers are alpha amino acids joined 
together through amide bonds. Peptides are two or often more 
anuno acid monomers long. Polypeptides are more then ten 
ammo acid residues. Proteins are more than thirty amino acid 
residues. Standard abbreviations for animo acids are used 
herein (see Stryer, 1988, Biochemistry, Third Ed 
incorporated herein by reference). 

" DNA Binding Protein" refers to a protein that 
specifically interacts with deoxyribonucleotide strands A 
sequence specific DNA binding protein binds to a specific 
sequence or family of specific sequences showing a high 
degree of sequence identity with each other (e.g., at least 
about 80 % sequence identity) with at least 100-fold greater 
affinity than to unrelated sequences. The dissociation 
constant of a sequence-specific DNA binding orotein to its 
specific sequence (s) is usually less than about 100 nM, and 
*ay be as low as 10 nM, 1 nM, 1 pM , or IfM. A nonseouence 
specific DNA binding protein binds to a olurality of 
unrelated DNA sequences with a dissociation constant that 
varies by less than 100-fold, usually less than tenfold, to 
tne different sequences. The dissociation constant of a 
nonsequence specific DNA binding protein to the plurality of 
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sequences is usually less than about: l: m . m the present 
invention, DNA binding protein can also refer to an RNA 
binding protein. 

••Recombinant protein" refers to a protein made up of at 
least two separate amino acid chains, which are naturally not 
contiguous. For example, any fusion protein like Lac 
repressor-p-galactosidase, any protein or polypeptide like 
the tyrosyl-tRNA synthetase like leucine zipper derived from 
protein GCN4 . 

According to the method of selecting • a molecule of 
interest of the present invention, the molecule of interest 
is capable of interacting with the target ligand and possibly 
of binding to said target ligand. 

In a specific embodiment of the method of selecting a 
molecule of interest of the present invention, the molecule 
of mterest is a mutant molecule compared to the known wild 
type molecule, and said molecule of interest is tested for 
its capacity of interacting with the target ligand. 

The present invention further relates to a kit for 
selecting a molecule of interest, wherein said kit comprises: 

(a) a signal amplification system according to the 
invention; 

(b) an E. coli strain, or any bacterial strain deficient 
m endogenous adenylate cyclase, or any other eukaryotic 
cell; and 

(c) a medium allowing the detection of the 
complementation selected from the group consisting of 
indicator plate or selective medium as minimal medium 
supplemented with lactose or maltose as unique carbon source, 
or medium with antibiotics, or medium to visualize 
fluorescence, conventional medium, and medium which allows 
sorting by the presence of the phage receptor. The indicator 
Plate is, for example, a MacConkey agar medium supplemented 
with lactose or maltose. 
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A bacterial strain deficient in endogenous adenylate 
cyclase means that this strain is not capable of cAMP 
synthesis . 

The present invention also relates to a kit for 
selecting a molecule of interest, wherein said kit comprises: 

(a) a signal amplification system according to the 
invention, wherein the molecule of interest is a mutant 
molecule compared to the known wild type molecule; 

(b) a signal amplification system according to the 
invention, wherein the molecule of interest is the known wild 
type molecule as the control; 

(c) E. coli strain, or any bacterial strain deficient in 
endogenous adenylate cyclase or any other eukaryotic cell; 
and 

(d) a medium allowing the detection of complementation 
selected from the group consisting of indicator or selective 
medium as minimal medium supplemented with lactose or maltose 
as unique carbon source, medium with antibiotics, medium to 
visualize fluorescence, conventional medium and medium which 
allows the sorting by the presence of the phage receptor for 
each signal amplification system; and 

(e) means for detecting whether the signal amplification 
system with the mutant molecule is enhanced or inhibited with 
respect to the signal amplification system with wild type 
molecule . 

The present invention includes a molecule of interest 
identified by the method of selecting a molecule of interest 
according to the present invention. 

The present invention further includes a molecule of 
interest corresponding to a polynucleotide capable of 
expressing a molecule, which interacts with a fused target 
ligand coupled with an enzyme or a fragment thereof. 

According to another aspect, the invention also concerns 
a method of screening for a substance capable of stimulating 
or inhibiting the interaction between a target ligand and a 
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molecule of interest, wherein the stimulating or the 
inhibiting activity is detected with a signal amplification 
system according to the invention, by means of generating a 
signal amplification and triggering transcriptional 
activation, and wherein said signal amplification and said 
triggering transcriptional activation are compared with those 
obtained from an identical signal amplification system 
without any substance. 

The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
Ugand and a molecule of interest allows the choice of a 
substance acting positively or negatively or even not acting 
in this interaction. 

In the method of screening for a substance capable of 
stimulating the interaction between a target ligand and a 
molecule of interest according to the invention, the signal 
amplification corresponds to the production of a signaling 
molecule and the transcriptional activation leads to a 
reporter gene expression. 

In the method of screening for substance capable of 
inhibiting the interaction between a target ligand and a 
molecule of interest according to the invention, the signal 
amplification corresponding to the production of a signaling 
molecule is blocked or partially abolished and the 
transcriptional activation leading to a reporter gene 
expression is also blocked or partially abolished. 

In one specific illustration of the present invention, 
the method of screening for a substance capable of 
stimulating or inhibiting the interaction between a target 
30 ligand and a molecule of interest, consists in a signal 
amplification system, which comprises a bacterial multi- 
hybrid system of at least two distinct fragments of an 
enzyme, whose enzymatic activity is restored by the 
interaction between the said molecule of interest and the 
said target ligand. The two fragments are any combination of 
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fragments from the enzyme, which in vitro functionally 
interact with the natural activator of said enzyme by 
restoring its activity. 

According to the method of screening for substance 
capable of stimulating or inhibiting the interaction between 
a target ligand and a molecule of interest of the present 
invention, the fragments are selected from the group 
consisting of : 

(a) a fragment T25 corresponding to amino acids 1 to 224 
of CyaA and a fragment T18 corresponding to amino acids 225 
to 399 of CyaA; 

(b) a fragment corresponding to amino acids 1 to 224 of 
CyaA and a fragment corresponding to amino acids 224 to 384 
of CyaA; 

(c) a fragment corresponding to amino acids 1 to 137 of 
CyaA and a fragment corresponding to amino acids 138 to 400 
of CyaA; 

(d) a fragment corresponding to amino acids 1 to 317 of 
CyaA and a fragment corresponding to amino acids 318 to 400 
of CyaA; and 

(e) two fragments from eukaryotic adenylate cyclase in 
association with molecules, such as G protein, and forskolin. 

And more particularly, the fragments are a fragment T25 
corresponding to amino acids 1 to 224 of Bordetella pertussis 
CyaA and a fragment T18 corresponding to amino acids 225 to 
399 of BordeteiJa pertussis CyaA. 

In another specific illustration of the present 
invention, the method of screening for a substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest, consists in a signal 
amplification system, which comprises a bacterial multi- 
hybrid system of at least a first fragment of an enzyme and 
a modulating substance, whose activity, which is an enzymatic 
activity, is restored by the interaction between the said 
molecule of interest and the said target ligand. 
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in both of the above illustrations of the present 
invention, the enzyme is selected from t-h Present 
of , selected from the group consisting 
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, , preferabl y che enzyme is the catalytic domain of 

400 amino acid residues of the adenylate cyclase toxin 
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for r"'" inVenti ° n also P»»i*. a kit for screening 

for a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest, wherein said kit comprises: 

(a) a signal amplification system according to the 
invention with the substance capable of stimulating or 
-0 inhibi ing the interaction between a target ligand and a 
molecule of interest; 

<b> a signal amplification system according to the 
invention without any substance as the control; 

(O E. call strain, or in any bacterial strain deficient 
endogenous adenylate cyclase, or any other eukaryotic 
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(d) a medium allowing for the detection of the 
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indicator plate or selective medium as minimal medium 
supplemented with lactose or maltose as unique carbon source 
medium with antibiotics, medium to visualize fluorescence 
conventional medium, and medium that allows the sorting by 
the presence of the phage receptor; and 

(e) means for detecting whether the signal amplification 
system with the substance is enhanced or inhibited with 
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respect to the signal amplification system without any 
substance . 

The present invention includes a substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest identified by the method of 
screening for a substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest according to the present invention. 

According to a preferred embodiment of the invention, 
the selection and the screening are performed in an E. coli 
strain, or in any bacterial strain deficient in endogenous 
adenylate cyclase, or any other eukaryotic cell. 

Functional analysis of B. pertussis adenylate cyclase 
activity can be easily monitored in an E. coli strain 
deficient in endogenous adenylate cyclase. In E. coli, cAMP 
bound to the transcriptional activator, CAP (catabolite 
activator protein), is a pleiotropic regulator of the 
expression of various genes, including genes involved in the 
catabolism of carbohydrates, such as lactose or maltose 
(Ullmann, A. & Danchin, A. (1983) in Advances in Cyclic 
Nucleotide Research (Raven Press, New York), Vol. vol. 15, 
PP. 1-53). Hence, E. coli strains lacking cAMP are unable to 
ferment lactose or maltose. When the entire catalytic domain 
of CyaA (amino acids 1 to 399) is expressed in E. coli cya 
under the transcriptional and translational control of lacZ 
(plasmid pDIA5240), its calmodulin-independent residual 
activity is sufficient to complement an adenylate cyclase 
deficient strain and to restore its ability to ferment 
lactose or maltose (Ladant, D., Glaser, P. & Ullmann, A. 
(1992) j. Biol. Chem. 267, 2244-2250). This can be scored 
either on indicator plates (i.e. LB-X-Gal or MacConkey media 
supplemented with maltose) or on selective media (minimal 
media supplemented with lactose or maltose as unique carbon 
source) . 
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The fact that the genetic tests according to the 
invention are carried out in E. coli greatly facilitates the 
screening as well as the characterization of the interaction 
between the target ligand and the molecule of interest. 
Firstly, it is possible to use the same plasmid constructs to 
screen a library to identify the molecule of interest, also 
called a putative binding partner, to the target ligand, also 
called a given "bait", and then to express the target ligand 
and the molecule of interest in order to characterize their 
interaction by in vitro binding assays. 

Secondly, the high efficiency of transformation that can 
be achieved in E. coli, allows the analysis of libraries of 
high complexity. This is particularly useful for i) the 
screening and the selection of peptides from a library made 
from random DNA sequences that present an affinity for a 
given bait protein, and ii) the exhaustive analysis of the 
network of interactions between the proteins of a given 
organism (Bartel, P. L . , Roecklein, J. A., SenGupta, D. s 
Fields, S. (1996) Nature Genetics 12, 72-7; Fromont, R. M., 
Rain, J. c. & Legrain, P. (1997) Nature Genetics 16, 277-82). 

The present invention further relates to a 
polynucleotide sequence coding for a signal amplification 
system according to the invention, wherein the polynucleotide 
sequence codes for a bacterial multi-hybrid system of at 
least two chimeric polypeptides containing: 

(a) a first chimeric polypeptide corresponding to a 
first fragment of an enzyme fused to a molecule of interest; 



and 



(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme fused to a target ligand. 

The present invention also relates to a polynucleotide 
sequence coding for the signal amplification system according 
to the invention, wherein the polynucleotide sequence codes 
for a bacterial multi-hybrid system containing: 
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(a) a first chimeric polypeptide corresponding to a 
first fragment of an enzyme fused to a molecule of interest; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 

5 capable of activating said enzyme fused to a target ligand; 
and 

(c) a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest. 

10 This invention will be described in greater detail with 

reference to the following examples. 
Example 1 

DHP1 is an adenylate cyclase deficient (cya) derivative 
of DH1 (F-, glnV44(AS), recAl , endAl , gyrA96 (Nal r ) , thil, 

15 hsdR17, spoTl, rfbDl) (25), and was isolated using 
phosphomycin as a selection antibiotic (Alper, M. D. & Ames, 
B. N. (1978) J. Bacterid. 133, 149-57) . Growth media used 
were the rich medium L3 or the synthetic medium M63 (Miller, 
J- H. (1972) Experiments in molecular genetics (Cold Spring 

20 Harbor Laboratory, Cold Spring Harbor, N.Y.)) supplemented 
with 1 % carbon source. Antibiotic concentrations were as 
follows: ampicillin 100 mg/ml and chloramphenicol 30 mg/ml . 
Screening for the ability to ferment sugars was performed 
either on MacConkey agar plates containing 1 % maltose, or on 

25 LB plates containing 40 mg/ml X-Gal ( 5-Bromo-4-chloro-3- 
indolyl-b-D-galactopyranoside) and 0.5 mM IPTG ( Isopropyl-b- 
D-thiogalactopyranoside) . 
Example 2 

Plasmid pKT25 (3445-bp) is a derivative of the low copy 
30 vector pSU40 (expressing a kanamycin resistance selectable 
marker) that encodes the T25 fragment. It was constructed as 
follows: a 1044 -bp Hindi 1 1 -EcoRI , fragment of pT25 was first 
subcloned into pSU40 linearized with Hindi I I and EcoRI, 
resulting in pKT25L. pKT25 was generated from pKT25L by 
35 deleting a 236-bp Nhel-Hindlll fragment. 
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Plasmid purie (3023 -bp) is a derivative of the high copy 
number vector pUC19 (expressing an ampicillin resistance 
selectable marker and compatible with pT25 or pKT25) that 
encodes the Tie fragment (amino acids 225 to 399 of CyaA) . In 
a first step, we constructed plasmid pUC19L by inserting a 
24 -bp double-stranded oligonucleotide (5.. 

ATTCATCGATATAACTAAGTAA-3 1 [SEQ ID No.: 1] ) and 



its 



complementary sequence) between the EcoRI and A/del sites of 
PUC19. Then, a 534-bp fragment harboring the T18 open reading 
frame was amplified by PCR (using appropriate primers and 
PT18 as target DNA) and cloned into P UC19L digested by ScoRI 
and Clal (the appropriate restriction sites were included 
into the PCR primers). In the resulting plasmid, pUT18, the 
T18 open reading frame is fused in frame downstream of the 
multicloning site of P UC19. This plasmid is designed to 
create chimeric proteins in which a heterologous polypeptide 
is fused to the N- terminal end of T18 (see map) . 

Plasmid pUTiec (3017-bp) is a derivative of pUC19 
(expressing a ampicillin resistance selectable marker and 
compatible with pT25 or pKT25) that encodes the Tie fragment. 
It was constructed by subcloning the same 534-bp PCR- 
amplified fragment harboring the T18 open reading frame 
described above into pUC19L linearized by Hindi I I and PstI 
(the appropriate restriction sites were included into the PCR 
primers) . In the resulting plasmid, pUT18C, the Tie open 
reading frame is fused in frame upstream of the multicloning 
site of pUC19L. This plasmid is designed to create chimeric 
proteins in which a heterologous polypeptide is fused to the 
C-terminal end of T18 (see map) . 

Plasmid pKT25- 2 ip (3556-bp) is a derivative of pKT25 
that was constructed by inserting a DNA fragment "(PCR- 
amplified using appropriate primers) encoding the leucine 
zipper region of GCN4 into pKT25 cleaved by Kpnl , as 
described above . 

Plasmid pUT18- 2 i P (3125-bp) is a derivative of pUT18 
that was constructed by inserting a H4bp DNA fragment (PCR- 
amplified using appropriate primers) encoding the leucine 
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zipper region of GCN4 into pUT18 linearized by Kpnl and 



EcoRI 



Plasmid pUT18c- 2i p (3119-bp) is a derivative of pUTlBC 
that was constructed by inserting the same 114 -bp DNA 
fragment encoding the GCN4 leucine zipper described above 
into pUTl8 linearized by Kpnl and EcoRI . 
Example 3 

Briefly, a cAMP-biotinylated-BSA conjugate was coated on 
EL ISA plates and non-specific protein binding sites were 
blocked with BSA. Boiled bacterial cultures were then added, 
followed by diluted rabbit anti-cAMP antiserum in 50 mM Hepes 
PH 7.5, 150 mM NaCl , o.l % Tween 20 (HBST buffer) containing 
10 mg/ml BSA. After overnight incubation at 4°C, the plates 
were washed extensively with HBST, then goat anti-rabbit IgG 
coupled to alkaline phosphatase (AP) was added and incubated 
for l hr at 30°C. After washing, the AP activity was revealed 
by 5' -para-nitrophenyl phosphate. cAMP concentrations were 
calculated from a standard curve established with known 
concentrations of cAMP diluted in LB medium. 
Example 4 

Two compatible plasmids (derived from pACYC184 and 
pBluescript-II-KS) that express either the T25 fragment 
corresponding to amino acids 1 to 224 of CyaA or the T18 
fragment corresponding to amino acids 225 to 399 were 
constructed. A multicloning site was fused to the C-terminal 
end of T25 to facilitate construction of fusions with foreign 
proteins. Similarly, the T18 fragment was fused in frame to 
alacz of pBluescript-II-KS downstream of its multicloning 
siteK (Figure 2) . 

The two plasmids, pT25 and pT18, were co- trans formed in 
DHP1, a cya derivative of the E. coli strain DH1 (Hanahan, D. 
(1983) J. Mol. Biol. 166, 557-80), and plated on MacConkey 
agar supplemented with maltose. As expected, no spontaneous 
complementation between the two isolated (independently 
expressed) fragments could be detected in vivo: all the 
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transformants were white (see Table 1). When the DHP1 strain 
was transformed with a plasmid expressing the full catalytic 
domain, all colonies were red (Table 1). 

To test whether functional complementation between T25 
and T18 could be brought about by fusing them to interacting 
proteins, there was inserted, within the multicloning site of 
both P T25 and pT18, a DNA sequence that codes for a 35 amino 
acid long leucine zipper derived from protein GCN4, a yeast 
transcriptional activator (Blondel, A. & Bedouelle, H. (1991) 
Protein Engineering 4, 457-61). When the resulting plasmids, 
pT25-zip and pT18-zip, were co- transformed in DHP1 and plated 
on MacConkey/maltose media, the resulting colonies became red 
after 24-30 hours of growth at 30°C (Table 1) . 

Control experiments were carried out in which pT25-zip 
was co-transformed with pT18, or pT18-zip was co- transformed 
with P T25. None of the transformants exhibited 
complementation, demonstrating that the functional 
complementation of T25-zip and T18-zip was mediated by the 
interaction of their leucine zipper motif. The efficiency of 
complementation could be further quantified by measuring in 
liquid cultures, either cAMP levels or p-galactosidase 
activities (Table 1) . 

Adenylate cyclase activities of the different 
transformants were measured in cell extracts in the presence 
of CaM that binds tightly to T25 and T18 fragments to form 
the active adenylate cyclase complex. As shown in Table 1, 
only the extract from DHPl/pT25-zip/pT18-zip exhibited a 
significant enzymatic activity. The lack of activity in the 
extracts of the three other types of transformants indicates 
that, at least one of the two complementary fragments of 
adenylate cyclase was missing, most probably as a consequence 
of its in vivo proteolytic degradation. Therefore, it would 
appear that the association of T25-zip and T18-zip, through 
their leucine zipper motif, not only resulted in their 
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functional complementation, but also in their stabilization. 
Stabilization of protein fragments (a and w peptides) through 
complementation (Ullmann, A., Jacob, F. & Monod, J ♦ (1968) J. 
Mol. Biol. 32, 1-13) has also been observed for 6- 
5 galactosidase . 
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Example 5 

Screening for in vivo protein-protein interactions by 
using functional complementation of T25 and T18 was carried 
out . 

The goal was to examine whether the complementation 
between T25 and T18 could be used to analyze interactions 
between proteins larger than the 35-residue long leucine 
zipper motif. A DNA fragment that encodes the N-terminal part 
(residues 1 to 290) of the dimeric tyrosyl tRNA synthetase 
from Bacillus stearothermophilus (Guez-Ivanier, V. & 
Bedouelle, H. (1996) J. Mol . Biol. 255, 110-120) was 
subcloned into the multicloning site of plasmids pT25 and 
pT18. The resulting plasmids, pT25-TyrRS and pT18-TyrRS, when 
co- trans formed in DHP1, yielded red transf ormants on 
MacConkey/maltose. The transformants synthesized cAMP and 
expressed p-galactosidase (Table 2) . Control transformations 
confirmed that the TyrRS moiety was responsible for the 
functional complementation between T25-TyrRS and Tl 8-TyrRS . 
Furthermore, no complementation occurred when T25-TyrRS was 
cotransformed with pT18-zip or vice versa. This demonstrates 
that the complementation was dictated by the specificity of 
recognition of the polypeptides fused to the two fragments, 
T25 and T18. 

It was further shown (Table 2) that the bacterial multi- 
hybrid system could detect interaction between the yeast 
splicing factors Prpll and Prp21 (fused to T25 and T18, 
respectively) that was previously characterized in the yeast 
two-hybrid assay (Legrain, P. & Chapon, C. (1993) Science 
262, 108-10) . This demonstrates that this bacterial 
complementation assay can reveal association between 
eukaryotic proteins. 
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To mimic a screening procedure, plasmids pT18-zip and 
pT18-TyrRS were mixed with about a 5-fold excess of pT18 and 
co-transformed this mixture in DHP1 with either pT25 or pT25- 
zip. The transformants were plated on LB-X-Gal. All the 
colonies co-transformed with pT25 were white (Fig. Ah) 
Around 20 % of the colonies were blue when the cells were co- 
transformed with the mixture of P T10 derivatives and P T25- 
zip. The plasmid DNAs of these clones were further analyzed 
by restriction mapping. As expected, the blue colonies among 
the bacteria co- transformed with P T25-zip harbored only P T18- 
zip (Fig. 4B) . 

In another series of experiments, pT18-zi D was mixed 
with a 1, 000 fold excess of P T18 and this mixture was 
transformed in DHP1 harboring P T25-zip and plated on 
MacConkey/maltose. Three red colonies were identified among 
about 3,000 white ones. Plasmid DNA analysis of the Mal+ 
clones confirmed the presence of pT18-zip. Transformation of 
the same mixture of pT18-zip/ P T18 into DHP1 harboring P T25 
gave no Mai* clones out of 10,000 analyzed (data not shown). 
These results indicate that the functional complementation 
between the adenylate cyclase fragments could be used to 
identify interacting proteins in E. coli. 

Finally, an examination was made to determine whether 
the complementation between T25 and T18 could be used in a 
selection procedure rather than using the screening described 
above. DHP1 bacteria cotransf ormed with complementing 
plasmids (pT2S-zi P / P TI8-zip or P T25-TyrRS/pT18-TyrRS) were 
able to grow on minimal media supplemented with lactose or 
maltose as unique carbon sources, while bacteria 
cotransformed with non-complementing plasmids ( P T25-zip/pTI 
8-TyrRS or P T25-TyrRS/pTl 8-zip) did not grow. 

To determine whether this selection could be used to 
identify interacting proteins among an excess of non- 
interacting ones, the following "model screening" was 
performed on selective media: DHP1 bacteria harboring pT25- 
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zip and pT18-zip (expected phenotype: Lac') were mixed with 
a 10 5 -excess of DHPl/pT25/pT18 (expected phenotype: Lac), and 
then 10 7 cells from this mixture were plated on minimal media 
supplemented with lactose plus antibiotics. After 4-5 days at 
5 30°C, 100 to 200 Lac* colonies appeared. Plasmid DNA analysis 
indicated that 18 out of 20 of these colonies tested harbored 
pT25-zip and pT18-zip. When 10 7 DHPl/pT25/pT18 cells were 
plated on minimal media/lactose, about 10 colonies were 
detected: these cells appeared to represent spontaneous 
10 revertants of DHP1 to a Lac' phenotype (due to either 
reversion of cya" to cya* or to cAMP/CAP independent lac 
promoter mutations) . This "model screening" demonstrates that 
bacteria expressing specific interacting proteins fused to 
the adenylate cyclase fragments could be selected among a 
15 large number (here a lO'-fold excess) of irrelevant clones. 
Example 6 

A further test was carried out to determine whether 
functional complementation could be obtained when the 
interacting polypeptide is fused at the C-terminus of T18 
rather than at its N-terminus. Two new plasmids derived from 
the pUC19 vector were constructed for this purpose. In pUT18, 
the T18 polypeptide is fused in frame downstream to the 
multicloning site of pUC19, whereas in pUT18C, the T18 
polypeptide is fused in frame upstream of the multicloning 
site. A DNA fragment encoding the leucine zipper of GCN4 was 
then cloned in frame into both pUT18 and pUT18C to yield 
pUT18-zip and pUT18C-zip. 

As shown in Table 3, co-transformation of DHP1 with 
pT25-zip and either pUT18-zip or pUT18C-zip led to functional 
complementation. This indicates that interacting polypeptides 
could be fused at both ends of the T18 fragment with the same 
complementation efficiency. Similar results were obtained 
(Table 3) when the T25-zip chimeric protein was expressed 
from a pSU40 derivative that expresses the kanamycin 
resistance selectable -marker (pKT25-zip) . 
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e_3: Comparison of complementation between N- terminal and 
C-terminal fusion proteins 



Plasmids 


Phenotype on 
MacConkey/maltose 


ft-galactosidase 
units/mg dry weight 
bacteria 


pT25 + pT18 


White / 72 hrs 


154 


pT25-zip + pUT18-zip 


Red / 26 hrs 


5100 


pT25-2ip + pUT18 


White / 72 hrs 


ND 


pT25 + pUT18-zip 


White / 72 hrs 


ND 


pT25-zip + pUT18-2ip 


Red / 26 hrs 


6180 


pT25-2ip + pUT18C 


White / 72 hrs 


ND 


PT25 + pUT18C-zip 


White / 72 hrs 


ND 


pT25-zip + pUT18C-2ip 


Red / 26 hrs 


6100 


pKT25-2ip + pUT18 


White / 72 hrs 


ND 


pKT25 + pUT18-zip 


White / 72 hrs 


ND 


pKT25-zip + pUT18-zip 


Red / 26 hrs 


ND 


pKT25-2ip + pUT18C 


White / 72 hrs 


ND 


pKT25 + pUT18C-zip 


White / 72 hrs 


ND 


pKT25-zip + pUT18C-2ip 


Red / 26 hrs 


ND 



Bacteria were grown in LB at 30°C in the presence of 
appropriate antibiotics. 
ND: not done. 



Example 7 

The bacterial two-hybrid system was used to analy2e 
interactions between various sub-domains of the dimeric 
tyrosyl-tRNA synthetase from B. stearothennophilus , (TyrRS) , 
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which is a symmetrical dimer (Brick, P. and D. M. Blow (1987) 
J Mol. Biol. 194,2 87-297). its monomer is composed of two 
domains . 

Deletion of the C-terminal domain (321-419) of TyrRS 
produces a truncated ATyrRS which activates tyrosine as the 
full-length molecule but is no longer able to bind the 
cognate tRNA. The truncated AtyrRS (1-320) forms a dimer that 
closely resembles the wild-type one. 

The crystal structure of the dimeric ATyrRS revealed 
that each monomer contains two structural domains: an a/p 
domain (1-220) containing six-stranded p-sheets and an ct- 
helical domain (221-320) containing five helices (Brick, P. 
and D. M. Blow 1987) J. Mol . Biol. 194, 287-297). The dimer 
is formed by the association of a hydrophobic surface 
encompassing residues 128-167 within the ct/p domain of each 
subunit . 

To analyze interactions between various sub-domains, 
different fragments of the TyrRS polypeptide (generated by 
PCR using appropriate primers) were fused in frame with 
either the T25 or the T18 fragment and the resulting chimeric 
proteins were tested for functional complementation in DPH1 . 
The results, summarized in Fig. 5, revealed 3 different types 
of interactions between the TyrRS monomers or between the 
TyrRS sub-domains: 

1) Dimerisation through the ct/p domains as can be seen 
in the crystal structure of TyrRS. For instance, the chimeric 
protein T18-TyrRSl-249, which harbors only the a/p domain, 
can fully complement T25-TyrRSl-333 , which contains both the 
a/p and a domains. Previous studies have shown that 
introduction of charged residues into the hydrophobic subunit 
interface of dimeric TyrRS induces reversible dissociation 
(Ward, W.H.J. , H. Jones, and A. R. Ferscht (1987) Biochemistry 
26, 4131-4138) . Confirmation that a point mutation that 
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converts Phel64 to Arg abolishes the interaction between the 
<x/p domains (as shown by the absence of functional 
complementation) has been made. 

2) Dimerisation through the a domains, which has not 
been previously predicted from the crystal structure of 
TyrRS. Analysis of complementation between various fragments 
indicated that this dimerisation is mediated by the C- 
terminal region of the (a domain (Fig. 5)). This region 
contains a pseudo leucine .zipper motif (LLL on Fig. 5) made 
of 3 leucine residues at positions 298, 305, and 312. The 
same segment can also mediate a specific interaction with the 
GCN4 leucine-zipper (Fig. 5) . 

3) Interaction between the a/A domain and the a domain. 
This study illustrates the interest of the bacterial 

two-hybrid in delineating interacting domains of proteins and 
shows that it could reveal interactions, which occur in vivo 
and that were not expected from the three dimensional 
structure . 
Example 8 

The two-hybrid system was also used to analyze the 
dimerization of a DNA-binding protein from B. pertussis, BvgA. 
BvgA is a transcriptional regulator, which in B. pertussis 
controls the expression of virulence-associated genes. It is 
a member of the bacterial two-component signal transduction 
family, together with its cognate sensor protein, BvgS 
(Scarlato et ai . (1990) Proc. Nad. Acad. Sci . USA, 87:6753). 
The transmembrane BvgS is autophosphorylated in response to 
environnemental signals and subsequently phosphorylates BvgA. 
BvgA, in its phosphorylated form, can bind to specific DNA 
sequences within the promoter of several virulence genes and 
activates their transcription. Several studies previously 
suggested that BvgA might have the capacity to dimerize 
although a direct demonstration that BvgA is a dimer is still 
lacking . 
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To study the dimerization of BvgA, a set of plasmids was 
constructed that encode various fragments of the BvgA 
polypeptide fused to either T25 or T18. These plasmids were 
co-transformed in DHP1 (Fig. 6), and the level of functional 
complementation between the different chimeric proteins was 
determined by measuring p-galactosidase activities. These 
results indicate that BvgA can indeed dimerize and that the 
critical region required for dimerization is localized within 
the central part of the protein. 
Example 9 

A selection procedure was also established that will 
permit an easy screening for mutations that abolish the 
interaction between two hybrid proteins. This selection is 
based on the well established fact that E.coli cya* strains 
are resistant to phage X, whereas E.coli cya strains are 
sensitive. The phage receptor, the LamB protein, is the 
product of the lamB gene, which is part of the maltose 
regulon; therefore, its expression requires cAMP. m 
consequence, cells producing cAMP will l yse when infected 
with Xvir. Molecules or mutations that abolish the 
interaction between two hybrid proteins will abrogate cAMP 
synthesis, and, therefore, the cells should become resistant 
to phage X. 

Quantification of resistance to phage Xvir has been 
performed in liquid Luria broth. Experimental conditions of 
infection that enable complete lysis of cAMP producing 
bacteria are the following: 

- multiplicity of infection: 2 to 10, 

- MgS0 4 concentration: 20 mM, 

- incubation time at 37°c under aeration for 2 to 3 
hours . 

DHP1 bacteria were grown overnight in the presence of 1 
mM cAMP in Luria broth. The bacteria were washed 3 times with 
Luria broth, and an a-liquot was immediately infected with 
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Xvir. Another aliquot was diluted and cultivated in the 
absence of cAMP at 37°C. After 15 generations of growth, a 
sample of the bacteria grown in the absence of cAMP has been 
infected with \vir. After serial dilutions, bacteria were 
plated on solid Luria broth and counted. Out of 1.6 x 10° 
bacteria grown in the absence of cAMP 1.2 x 10° phage 
resistant clones were counted (75 %) , whereas out of 1.3 x 
10° bacteria grown in the presence of cAMP, only 140 Jt- 
resistant clones (frequency of 10- 6 ) were found. These latter 
X-resistant clones were white on MacConkey maltose plates, 
suggesting that they were cAMP-independent mal T mutants . 

Plasmids useful for practicing this invention have been 
deposited at Collection Nationale de Cultures de 
Microorganismes in Paris, France on November 25, 1998, as 
15 follows: 

Plasmid Accession No. 

XL-l/pUT18 1-2092 
XL-l/pUT18C 1-2093 
XL-l/pT25 1-2094 
20 XL-l/pKT25 1-2095 

As it appears from the teachings of the specification, 
the invention is not limited in scope to one or several of the 
above detailed embodiments; the present invention also 
embraces all the alternatives that can be performed by one 
skilled in the same technical field, without deviating from 
the subject or from the scope of the instant invention. 
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1. A signal amplification system comprising a bacterial 
multi-hybrid system of at least two chimeric polypeptides 
containing : 

(a) a first chimeric polypeptide corresponding to a 
first fragment of an enzyme; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme, 

wherein the first fragment is fused to a molecule of interest 
and the second fragment or the modulating substance is fused 
to a target ligand and wherein the activity of the enzyme is 
restored by the in vivo interaction between the said molecule 
of interest and the said target ligand and wherein a signal 
amplification is generated. 

2. The signal amplification system according to claim 

1, wherein the enzyme is an enzyme selected from the group 
consisting of adenylate cyclase and guanylate cyclase from 

20 any origin. 

3. The signal amplification system according to claim 

2, wherein the enzyme is the catalytic domain of Bordetella 
adenylate cyclase (CyaA) , located within the first 400 amino 
acid residues of the adenylate cyclase toxin. 

25 4 - Th e signal amplification system according to claim 

3, wherein the first and the second fragments are any 
combination of fragments from the same enzyme which in vitro 
functionally interact with the natural activator of said 
enzyme by restoring its activity. 

30 5 - Th e signal amplification system according to claim 

4, wherein the first and the second fragments are selected 
from the group consisting of : 

(a) a fragment T25 corresponding to amino acids 1 to 224 
of CyaA and a fragment T18 corresponding to amino acids 225 
35 to 399 of CyaA; 
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(b) a fragment corresponding to amino acids 1 to 224 of 
CyaA and a fragment corresponding to amino acids 224 to 384 
of CyaA; 

(c) a fragment corresponding to amino acids 1 to 137 of 
CyaA and a fragment corresponding to amino acids 138 to 400 
of CyaA; 

(d) a fragment corresponding to amino acids 1 to 317 of 
CyaA and a fragment corresponding to amino acids 318 to 400 
of CyaA; 

(e) two fragments from eukaryotic adenylate cyclase in 
association with molecules such as G protein, forskolin. 

6. The signal amplification system according to claim 
4 or 5, wherein the first and the second fragments are a 
fragment T25 corresponding to amino acids 1 to 224 of 
Bordetella pertussis CyaA and a fragment T18 corresponding to 
amino acids 225 to 399 of Bordetella pertussis CyaA. 

7. The signal amplification system according to any one 
of the claims 1 to 3, wherein the modulating substance is a 
natural activator, or a fragment thereof, of the enzyme. 

8. The signal amplification system according to claim 7, 
wherein the natural activator is the calmodulin (CaM) , or a 
fragment thereof, and said first fragment is mutated compared 
to the wild type enzyme. 

9. The signal amplification system according to claim 
25 8, wherein the first fragment is a mutated fragment of the 

catalytic domain of Bordetella adenylate cyclase (CyaA) . 

10. A method of selecting a molecule of interest which 
is capable of binding to target ligand wherein the 
interaction between the said molecule of interest and the 
said target ligand is detected with a signal amplification 
system according to any one of the claims 1 to 9, by means of 
generating a signal amplification and triggering 
transcriptional activation. 
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11. The method of selecting a molecule of interest 
according to claim 10, wherein the signal amplification 
corresponds to the production of a signaling molecule. 

12. The method of selecting a molecule of interest 
according to claim 10, wherein the transcriptional activation 
leads to a reporter gene expression. 

13. The method of selecting a molecule of interest 
according to any one of claims 10 to 12, wherein the signal 
amplification system comprises a bacterial multi-hybrid 
system of at least two distinct fragments of an enzyme, whose 
enzymatic activity is restored by the interaction between the 
said molecule of interest and the said target ligand. 

14. The method of selecting a molecule of interest 
according to any one of claims 10 to 12, wherein the signal 
amplification system comprises bacterial multi-hybrid system 
of at least a first fragment of an enzyme and a modulating 
substance, whose activity is restored by the interaction 
between the said molecule of interest and the said target 
ligand. 

15. The method of selecting a molecule of interest 
according to any one of claims 10 to 14, wherein the target 
ligand is selected from the group consisting of protein, 
peptide, polypeptide, receptor, ligand, antigen, antibody, 
DNA binding protein, glycoprotein, lipoprotein and 
recombinant protein. 

16. The method of selecting a molecule of interest 
according to any one of claims 10 to 15, wherein the molecule 
of interest is capable of interacting with the target ligand 
and possibly of binding to said target ligand. 

17. The method of selecting a molecule of interest 
according to any one of claims 10 to 16, wherein the 
interaction between the molecule of interest and the target 
ligand is detected, by means of signal amplification which 
triggers transcriptional activation, and is quantified by 
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measuring the synthesis of the signaling molecule or the 
expression of the reporter gene. 

18. The method of selecting a molecule of interest 
according to claim 11, wherein the signaling molecule 

5 corresponds to the synthesis of cAMP. 

19. The method of selecting a molecule of interest 
according to claim 11, wherein the signaling molecule 
corresponds to the synthesis of cGMP. 

20. The method of selecting a molecule of interest 
0 according to claim 12, wherein the reporter gene expression 

is selected from the group consisting of gene coding for 
nutritional marker such as lactose, maltose; gene conferring 
resistance to antibiotics such as ampicillin, kanamycin or 
tetracyclic gene encoding for toxin; color marker such as 
5 fluorescent marker of the type of the Green Fluorescent 
Protein (GFP) ; gene encoding for phage receptor proteins or 
fragment thereof such as phage X receptor, lamB and any other 
gene giving a selectable phenotype. 

21. The method of selecting a molecule of interest 
according to any one of claims 10 to 20, wherein the molecule 
of interest is a mutant molecule compared to the known wild 
type molecule and said molecule of interest is tested for its 
capacity of interacting with the target ligand. 

22. The method of selecting a molecule of interest 
according to any one of claims 10 to 21, wherein the 
selection is performed in an E. coli strain, or in any 
bacterial strain deficient in endogenous adenylate cyclase or 
any other eukaryotic cell. 

23. A kit for selecting molecule of interest, wherein 
said kit comprises: 

(a) a signal amplification system according to any one 
of claims 1 to 9; 

(b) an E. coli strain, or in any bacterial strain 
deficient in endogenous adenylate cyclase or any other 

35 eukaryotic cell and; 
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(c) a medium allowing the detection of the 
complementation selected from the group consisting of 
indicator or selective medium as minimal medium supplemented 
with lactose or maltose as unique carbon source, medium with 
antibiotics, medium to visualize fluorescence, conventional 
medium and medium which allows the sorting by the presence of 
the phage receptor. 

24. A kit for selecting molecule of interest, wherein 
said kit comprises: 

(a) a signal amplification system according to any one 
of claims 1 to 9 wherein the molecule of interest is a mutant 
molecule compared to the known wild type molecule; 

(b) a signal amplification system according to any one 
of claims 1 to 9 wherein the molecule of interest is the 
known wild type molecule as the control; 

(c) E. coli strain, or in any bacterial strain deficient 
in endogenous adenylate cyclase or any other eukaryotic cell 
and; 

(d) a medium allowing the detection of the 
complementation selected from the group consisting of 
indicator or selective medium as minimal medium supplemented 
with lactose or maltose as unique carbon source, medium with 
antibiotics, medium to visualize fluorescence, conventional 
medium and medium which allows the sorting by the presence of 
the phage receptor for each signal amplification system; 

(e) means for detecting whether the signal amplification 
system with the mutant molecule is enhanced or inhibited with 
respect to the signal amplification system with wild type. 

25. a method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest wherein respectively the 
stimulating or the inhibiting activity is detected with a 
signal amplification system according to any one of the 
claims 1 to 9, by means of generating an amplification and 
respectively of triggering or of abolishing transcriptional 
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activation, and wherein said signal amplification and said 
triggered or abolished transcriptional activation are 
compared with those obtained from an identical signal 
amplification system without any substance. 

26. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to claim 2S, 
wherein the signal amplification system comprises a bacterial 
multi-hybrid system of at least two distinct fragments of an 
enzyme, whose enzymatic" activity is restored by the 
interaction between the said molecule of interest and the 
said target ligand. 

27. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to claim 25, 
wherein the signal amplification system comprises a bacterial 
multi-hybrid system of at least a first fragment of an enzyme 
and a modulating substance, whose activity is restored by the 
interaction between the said molecule of interest and the 
said target ligand. 

28. The method of screening for substance capable of 
stimulating the interaction between a target ligand and a 
molecule of interest according to any one of claims 25 to 27, 
wherein the signal amplification corresponds to the 
production of a signaling molecule. 

29. The method of screening for substance capable of 
inhibiting the interaction between a target ligand and a 
molecule of interest according to any one of claims 25 to 27, 
wherein the signal amplification corresponding to the 
production of a signaling molecule is blocked or partially 
abolished. 

30. The method of screening for substance capable of 
stimulating the interaction between a target ligand and a 
molecule of interest according to any one of claims 25 to 28, 
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wherein the transcriptional activation leads to a reporter 
gene expression. 

31. The method of screening for substance capable of 
inhibiting the interaction between a target ligand and a 
molecule of interest according to any one of claims 25 to 27 
and to claim 29, wherein the transcriptional activation 
leading to a reporter gene expression is blocked or partially 
abolished . 

32. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to any one of 
claims 25 to 31, wherein the target ligand is selected from 
the group consisting of receptor, ligand, antigen, antibody, 
DNA binding protein, glycoprotein and lipoprotein. 

33. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to any one of 
claims 25 to 32, wherein the substance is selected from the 
group consisting of protein, glycoprotein, lipoprotein, 
ligand and any other drug having stimulating or inhibitory 
affinity. 

34. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to claim 28 or 
29, wherein the signaling molecule corresponds to the 
synthesis of cAMP. 

35. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to claim 28 or 
29, wherein the signaling molecule corresponds to the 
synthesis of cGMP. 

36. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to claim 30 or 
31, wherein the reporter gene expression is selected from the 
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group consisting of gene coding for nutritional marker such 
as lactose, maltose; gene conferring resistance to 
antibiotics such as ampicillin, kanamycin or tetracyclic 
gene encoding for toxin; color marker such as fluorescent 
marker of the type of the Green Fluorescent Protein (GFP) ; 
gene encoding for phage receptor proteins or fragment thereof 
such as phage X receptor, lamB and any other gene giving a 
selectable phenotype. 

37. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to any one of 
claims 25 to 36, wherein the molecule of interest is a mutant 
molecule compared to the known wild type molecule and said 
molecule of interest is tested for its capacity of 
interacting with the target ligand. 

38. The method of screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest according to any one of 
claims 25 to 37, wherein the screening is performed in an E. 
coli strain, or in any bacterial strain deficient in 
endogenous adenylate cyclase or any other eukaryotic cell. 

39. a kit for screening for substance capable of 
stimulating or inhibiting the interaction between a target 
ligand and a molecule of interest, wherein said kit 
comprises : 

(a) a signal amplification system according to any one 
of claims 1 to 9 with the substance capable of stimulating or 
inhibiting the interaction between a target ligand and a 
molecule of interest; 

(b) a signal amplification system according to any one 
of claims 1 to 9 without any substance as the control; 

(c) E. coli strain, or in any bacterial strain deficient 
in endogenous adenylate cyclase or any other eukaryotic cell 
and; 
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(d) a medium allowing the detection of the 
complementation selected from the group consisting of 
indicator plate or selective medium as minimal medium 
supplemented with lactose or maltose as unique carbon source, 
medium with antibiotics, medium to visualize fluorescence, 
conventional medium and medium which allows the sorting by 
the presence of the phage receptor and; 

(e) means for detecting whether the signal amplification 
system with the substance is enhanced or inhibited with 
respect to the signal amplification system without any 
substance . 

40. a molecule of interest identified by the method of 
any one of the claims 10 to 22. 

41. A molecule of interest corresponding to a 
polynucleotide capable of expressing a molecule which 
interacts with a fused target ligand coupled with an enzyme 
or a fragment thereof. 

42. A substance capable of stimulating or inhibiting 
the interaction between a target ligand and a molecule of 
interest identified by the method of any one of the claims 25 
to 38. 

43. The signal amplification system according to any 
one of the claims 1 to 9, wherein the bacterial multi-hybrid 
system contains: 

(a) a first chimeric polypeptide corresponding to a 
first fragment a of an enzyme; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme and; 

(c) a substance capable of stimulating or inhibiting the 
interaction between a target ligand and a molecule of 
interest, wherein the first fragment is fused to a molecule 
of interest and the second fragment or the modulating 
substance is fused to a target ligand and wherein the 
activity of the enzyme is restored by the interaction between 
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the said molecule of interest and the said target ligand and 
wherein a signal amplification is generated. 

44. Polynucleotide sequence coding for the signal 
amplification system according to any one of the claims 1 to 
5 9, wherein the polynucleotide sequence codes for a bacterial 
multi-hybrid system of at least two chimeric polypeptides 
containing : 

(a) a first chimeric polypeptide corresponding to a 
first fragment a of an enzyme fused to a molecule of 

10 interest; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme fused to a target ligand. 

45. Polynucleotide sequence coding for the signal 
amplification system according to any one of the claims 1 to 
9 and to claim 43, wherein the polynucleotide sequence codes 
for a bacterial multi-hybrid system containing: 

(a) a first chimeric polypeptide corresponding to a 
first fragment a of an enzyme fused to a molecule of 

20 interest; 

(b) a second chimeric polypeptide corresponding to a 
second fragment of an enzyme or a modulating substance 
capable of activating said enzyme fused to a target ligand; 

(c) a substance capable of stimulating or inhibiting the 
25 interaction between a target ligand and a molecule of 

interest . 
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A Protein Complementation Assav for Detection of 
, „ „ „ .. Protein- Protein Interactions in vivo 

JoetleN Pcltrt,er.md Stephen W Michnick. Department de hinchnme. Unnerve 
de Montreal. Montreal. Quebec. Canada HIT IU 

,.en a V 3 (?LXc e ,^ S r.K 0l0Sy - lnClUdi I S ,ran « ri P ,10 n- >«n«laiion. and metabolic or 
-omolA^ w i pa "r a " s : Jre ""dialed by non-covalentlv-assoc.ated prote , n 

h™hlv*ucx«si£. " " ° r "V'"'" ' iUCh " "'"""unoprecipnaHon. are 
nigmy successiul. yet present .mportam limitations. Recently, a novel strateBv for 

(wZn^ZT?™'" """""M »™». based on complementation of protem 
defected by "he' TeTSV 01i «<»™"««»>« of ,wo P p rot e ,ns of „S 
Zll!.J f reasserr.6fy.ai a reporter enzyme, reconstituted from us 
™™Su men,ary ' ra S™n« iFig. 1). Here we report the development of ™!, 
acTvuvbC'c'Jrnn? " 1W b « d .°V ec ° ns<,tu ' i ° n o^ihydrofolatereduc^e IDHFrI 

used for ,ilrrr?.o. „ y described here requ.res no additional endogenous factors. „ 
8 Pe ^' c P TO,e,n P«"e'n mteracuons. and can be conveniently 
extended to screening cDNA hbrar.es for protein mteracnons conveniently 
Design considerations: We selected mDHFR as a reporter enzyme because- I i ,t „ 
e" mS" 0 ™" 6 - :> 5 ' n,C,Ural " d fUnC ''° nal '""^--"boS, DHFR 

structural fragments fo.ded as two to^ZZ^^K^^J^X^ 

such a* the leucine zippers i Z) used here < Fig. 1 ) S s ™ ms 

Results: Cotransfomiaiion of E. coliwnh constructs coding for Z-Ff 1 21 and Z-FHI 

"h^""r?oi hc r 1 of dcic ? mg r nzymat,c -<-<y 

*c« ^towt, L ^ , " JWf ' JlldCi SC|CC11VC ue-r-jn.. Soec.ficallv. detenu 

•Jn?? f »»u«raie$ that colony growtn anaer selective pressure is poss. • oniv , n 
hn . " prcSS,n 8 bo i h 'SB™ 11 * of " ^DHFR. The presence of the leCcine z ppc 
born fragments of mDHFR .s essential ipanel II. v S HI), demonstrate IhS 

Ov&7£S 0t d ° mam 2 ''u 3 " abS ° ,Ut " 'or DHFR reasTernblv 

Overexposed fus.on prote.ns or the expected molecular we lg ht were apparent oT 
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SDS-PAGE of crude iysate. 

S E abl lt lY mut a rUV Protein stability can be reduced by changing the side-chain volume 
in the hydrophobic core. We generated mutants of Ff3[ to change the efficiency 01 
reassembly of the mDHFR moiet.es into active enzyme, as tests for specific 
reconstuuiion. Thus, we mutated lie i 14 to Val, Ala. or Gly. Colonies obtained from 
conformation of Z-F(I 2) with Z-F(3:Ilc I UA!a| grew more slowly than those 
cot rans formed w.th Z-F(3J or Z-F[3:Ile I 14Val] (Fig. 2B: inset to panel II is 5 < 
enlarged) No colony growth was delected tn cells cotransformed with Z- 
r|3:Ilel I4(jly|. Th c doubling tim«measured for cells expressing Z-Ffl 21 + Z-FI3I 
Z-Ff 1.2] * Z-F[3:lle! !4Val) and Z-F[l.2] +- Z-Ff 3:Uc 1 1 4 Ala) were 1.6-fold I 9-fold 
and 4 "-fold. h<gher respectively, than the doubling time of E. coli expressing wild- 
type mDHFR. r * 

C 5?5!52 i0n ? : Wc havc dcvci °P cd a protein complementation assav based on 
mDHFR. where a leucine zipper directs the reconstuuiion of DHFR activity Activity 
was detected by an E. coli survival assay which is boih practical and inexpensive 
The m m-o assay described here could be applied to screening cDNA libraries for the 
detection of unknown, specific protein-protein interactions. 

RetererK<$ l!)N Johnson and A. V ar »h a vUy < 1994, P.WaS 22. 10340. Ci C. Byvirorf jnd / Kraut 
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FACTORS AND A^oIda^VM* ^ ^SE^Z^^^ 
BLOOD-BRAIN BARRIER AND RETAINED BIOACTTVrTT PERMEABILITY AT THE 

l.«on ar,d agin, model, and has been proper to num^X H ) """"" ^ ,nim> ' 
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ABSTRACT We present an approach for monitoring pro- 
tein-protein interactions within intact eukaryotic cells, which 
should increase our understanding of the regulatory circuitry 
that controls the proliferation and differentiation of cells and 
how these processes go awry in disease states such as cancer 
Chimeric proteins composed of proteins of interest fused to 
complementing 0-galactosidase (0-gal) deletion mutants permit 
□ novel analysis of protein complexes within cells. In this 
approach, the 0-gal activity resulting from the forced interaction 
ol nonfunctional weakly complementing /i-gal peptides (Aa and 
Ac) serves as a measure of the extent of interaction of the 
non./j-gal portions of the chimeras. To test this application of 
lacZ intracistronic complementation, proteins that form a com- 
plex in the presence or rapamycin were used. These proteins. 
FRAP and FKBP12. were synthesized as fusion proteins with Aa 
and Aa,. respectively. Enzymatic 0-gal activit- served to monitor 
cJ»o,T ial,on 0f the ra Pan>ycin-induced chimeric FRAP/ 
r KBP12 protein complex in a time- and dose-dependent manner 
as assessed by histochemical. biochemical, and fluorescence- 
activaied cell sorting assays. This approach .nay prove to be a 
valuable adjunct to in vitro immunoprecipitation and crosslink- 
ing methods and in vivo yeast two-hybrid and fluorescence energy 
transfer systems. It may also allow a direct assessment of specific 
protein dimerization interactions in a biologically relevant con- 
text, localized in the cell compartments in which they occur, and 
in the milieu of competing proteins. 

Specific interactions between proteins in mammalian cells are 
tnc basis of many essential biological processes. For example 
protein-protein interactions are involved in the assembly of 
enzymes and other protein homodimers and heterodimers that 
play important roles in the regulation of intracellular transport 
pathways, gene expression, receptor-ligand interactions, and 
in the therapeutic or toxic effects of administered drugs To 
increase our understanding of these biological processes sev- 
eral techniques have been developed for examining the inter- 
actions between proteins within cells. Coimmunoprecipiiation 
experiments with antibodies are suggestive of such interactions 
■ n that they allow a determination of the affinity of a given 
protein for another protein, albeit in vitro following cell lysis 
under conditions that cannot determine whether the two 
proteins are present within the same compartment or at the 
concentrations tested ( 1-3). Methods for crosslinking proteins 
within the cell and then cofractionating them by chromatog- 
raphy have also proven useful, although purification, sequenc- 
ing and identification of the crosslinked proteins can be 
difficult when they are present in small quantities 

In addition to such biochemical techniques, the yeast two- 
nybnd system has been extremely useful for detecting and iden- 
tifying protein-protein interactions in vivo (4-6). This system 
takes advantage of the properties of the GAL4 protein of the 
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yeast Saccharomyces cerevisiae, a transcriptional activator re- 
quired tor the expression of genes encoding enzymes involved in 

£d1d^n?£ a T ™ C GAL4 Pr ° ,ein COnsists of two se P ara « 
and identifiable domains, an N-terminal DNA binding domain 

and a C-terminal transcription activation domain. Separate fusion 
proteins, each comprising only one of the two GAL4 domains 
fused to one of two different test polypeptides, interact through 
affinity of the different test polypeptides, bringing two GAL4 
domains into close physical proximity and reconstituting GAL4 
function. A distinct advantage of this approach over biochemical 
approaches ,s that it allows identification of novel protein partners 
at a molecular level. However, the system requires that protein- 
protein interactions occur in the nucleus of a cell leading to 
\ d2, f iva ^ n ° f a reporter « eneand th * detection of 
on T?*k ThUS - thC iS indirect and 15 dependent 

on other cellular functions. Nonetheless, numerous previously 

™^P rotein . intera ««°" have been identified u.ing the yeast 
two-hybrid system. 3 

Fluorescence ratio imaging has also been used to study 
protein interactions in live cells (7). This innovative system has 
yielded several important new findings. However, it is limited 
by the requirement that the fluorescent labels on the inter- 
actmg proteins be sufficiently close to permit efficient energy 
ransfer. Also the labeled proteins need to be introduced info 

hatwiuM,^ a? hi8h concen,rati °<«.' Clearly, a method 
that would allow a direct examination of molecular interactions 
with fewer size constraints, at the site where they occur within 
a eukaryotic cell, would be advantageous 
o*~?* T 3 n ° VeI a PP licati °h of the bacterial lacZ 

fZ Jr, allOW thC direCt dete «ion of protein-protein 

interactions in situ in a range of cell types and species The 

E?£ m' ^ ge " e - ^ ala «™ da * tf-pjTSiTbeS 
Si ^u 5 ? 1 35 3 re P° r,er S e ne to measure transcrip- 
tional activity by histochemical or biochemical assays or bv live 
cell sorting (8-1 1). A property of the lacZ gene, inLcistronic 
complementation, has been known and studied for many vears 
m prokaryotes ( 12-14). but has only recently been adapted fo 

are IZ^T ( ° f inactive deletion mutants 
LsemHfnJ? e com P'«nient.ng one another in trans and 
*T?onllJ° T ? a " * cti rV Myme - Pa i« of deletion mutants 

demified m !« T th 61ther high ° r ,ow efficienc y »™ 
mn« Iff ( 6) ' '? preV,OUS s,udies of myoblast fusion, the 
Z „ [ , C,ent com P] ememi '>g P^r was employed to analyze 

L° Ie ° f Ce " adhesion and si « na ' transduction 
molecules in this poorly understood process (17) 

wet' ulK'nJT' Pa 'L S ° f inaCtive ^8 a ' deletion mutants 
™™ . Produce ch.menc proteins to test whether the 

comp e mented enzyme cou(d serve ^ a marker r the 

protein-protein interactions (see Fig. 1). Several features were 
critical in the application of this approach. Most important was 

Iffin?rl C f« S,,y 1° I 6 ' 6 " 3 ' 8al mUtants wi,h efficiently low 
affinity for each other so that they monitored rather than drove 
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the association of the test proteins Our WochemicaL histo- 
chcmical. and fluorescence-activated cell sorting (FACS) cx 
periments show that appropriate 0-gal m*™™**^ 
porated into fusion proteins that serve as tracers of prote.n 
protein interactions in intact eukaryotic cells. 



MATERIALS AND METHODS 

„r nU i7i ro a p. An and pWZL FKBP12-A<o 
Construction of pWZL rKAr-^a <wu y evnpn _ n£ i 
Viruses. To be able to fuse the sequences coding for FKBIM- and 
I PKRPI2-raDamvcin binding domain in-frame with 0-gal an 

afStol she was inserted in the Ncol sue at he ATG <^hc 
previouslv described ±c and A«, (3-gal mutants (13). Two ,Xhol 
Sail DNA fragments corresponding to am.no ac.ds 2025-2 1 4 ot 
human FRAP and to the complete coding sequence of human 
MPP (kind gifts of Gerald Crabtree. Stanford University 
"of Me^cine) were cloned into th< = ««^ *J 

modified Aa and Ao> mutants, generating FRAP-Aa ana 
FrSpi2 At For both constructs, conservation of the appropn- 
^te readine frame was confirmed by sequencing. 
a,e To £5 the FRAP-A« and FKBPl^^gJ^ ™ 
oWZL-neo and pWZL-hvgro retroviruses (P. J J^ or S ens ^ r ": 

fished work'), an ^^^S^ScSo 
and BamHl sites (GATCACCATGGACGCGTGGATCCM 
was inserted in the ftimHl and Xlw\ sites of the P WZ L^ctot^ 
B^th of the original sites were destroyed by th.s insertion The 
FR A P-la and FKBP 1 2-Aa> coding se que nces we re then * rted 
in the modified pWZL vectors as Ncol-BamW fragments 
'%iruTprod«ction and Infection Protoco.. Proviral < ronstn acts 
were introduced into the Phoenix ccotropic packaging cells (P. L. 
Achacoso and G. P. Nolan, unpublished work) by calcium phos- 
Jhate Section. The media containing retrovirus from the 
SSng cells was harvested 24-72 hr after transfection and 
used to infect C2C12 myoblasts (18) in the presence of 8 ^g/ml 
Sbrene (Sigma). Singlv and doubly infected cells were selected 
^ h the aDPropriatc drugs. Both G418 and hvgromyc.n were used 
I TflLTcZentrationof I mg/ml. The selected cells were 
expanded as populations for subsequent experiments. 

Quantitation of 0-Gal Activity. Unless otherw.se stated rapa- 
mvcin (Calbiochem) was used at a concentration of »0 ng/ml. 
^Biochemical auaLanon of 0*/ «ti. * By 
measured bv chemiluminesccnce as described 15). Briefly 
TeUs cultured in microtiter plates were lysed •„ smi 5°£ of 
a 1:1 mixture of lysis and assay buffers contain.ng Galacmn 
Plus substrate from the Galacto-Light Plus assay kit . (Trop.£ 
Bedford MA). Reactions were terminated after 1 hr a room 
femperature. After addition of Light Emission ^Accee rato , 
solution, luminescence was measured us.ng a M'" oBe,a 1450 
scintillation counter (Wallac. Gaithersburg. MD). 
SC SSL«/ dJrton of fi-gal using fl"™™?*™ 
chemistry {Fluor-X-Gal). Cells were processed « desenbed 
05). Briefly, cells grown on glass coverslips were : fixed in 4 A 
naraformaldehyde in PBS and rinsed twice with PBS. For 
^labeling (actin. 0-gal. and nuclei), feed cells were firs 
stained for actin by incubation in a solution of 1 txM biotin- 
S XX phailoidin (Molecular Probes) for 30 mm at roorr i tem- 
nerature. followed by PBS rinses and incubation n Cy5-labeled 
E™ptavidin (Amersham) (1:250 in PBS. 30 mm. room tem- 
perature). 3-gal was detected by incubation of fixed cells in a 
solution of 25 Mg/ml 5-bromo-6-chloro-3-indolyl fl-D galacto- 
SS* (Fluka) plus 100 ng/ml fast red violet LB (Sigma) 
fn PBS for 45 min at 37°C. Coverslips were rinsed 4 times .with 
PBS nuclei labeled with Hoechst 33258 (Calbiochem; 1 : 1WW0 
in PBS), rinsed again, mounted in PBS, and sea ed to glass 
"ides Triple-labePed deconvolved ^^^SS 
a DeltaVision dcconvolution microscope (APPl«d Precown- 
Mercer Island. WA). Images were represented in false color 
usfng Adobe Photoshop software. Fluor-X-Gal stammgean be 
Sted with either fluorescein or rhodam.ne filters; in this 
case detection was with the rhodamine filter, and is depicted 



^^Nail. Acad. Sci. USA 94 (1997) , 
in green (see Fig. 3 C and D). Cy5 was detected with a Cy5 filter 

cells were photographed using a Zeiss Ax.ophot f uorescence 
microscope P Fluor-X-Gal staining was detected with a rhoda- 
mine f -her and appears red; Hoechst was detected w,th a U V 

fi MS C /ga 8 . exp^in live cells was determined on a FACS 
as dlSnel(10) P exce P t that the fluorescein f^g^" 
anoside substrate was used at a concentration of 1 mM. 

RESULTS 

Desien of Fusion Protein Test System for Monitoring Inter- 
acting Comp^ B-Gal Mutants We have recenUy 
adapted focZ intracistronic complementation for use in eukary- 
odecens (15) For the purpose of testing the potential application 
oTgal complementation as a method to 
protein interactions, a pair of mutants was selected that contained 
The domains postulated to be necessary for trans-complementa- 
fon bm wh chTwere impaired in their ability to restore enzymatic 
Si«y upon coexpression in mammalian cells. These mutants 
w^ e the dassical I donor M15 (hereafter referred to as ^ 
which lacks amino acids 1 1-41 of the wild-type molecule, and an 

folded aructute thai render the „4»m.-i«»tmg 
^^^^^^^^ 

Endear We eaVoned that if complex formation but not ca alyt.c 
"cttitv were Impaired, it should be possible to obta.n efficient 
commentation by promoting the heterodimenzation of weakly 

complex (21-24) The intracellular rapamyc.n binding prote.n. 
complex . ecvop\->\ interacts with the intra- 

FK506-binding protein- 12 (FKBP 1 i). interat » w '" . 
Lllular FKBP-rapamycin associated prote.n (FRAP) only 
when rapamvein isVesent in the culture medium. Th.s . inter 
action is P we.l documented to increase over time am ^^wnh the 
Hose of rapamvein. Rapamycin is a small, cell-permeaoic 
mofecu eX bmds to the two intracellular proteins via 
TncSent determinants. Since rapamycin is > unjbk Mlobjnd 
two FKBP12 molecules at the same time and FRAP only binds 
Spamycin within the FKBP12-rapamyan complex het- 
erodimers do not form unless rapamycin is presen (f^ 
Specifically, the test system involved ^comb n.ng the .weakly 
. complementing 8-gal mutants, la or with the FKBP 'XV 
FRAP/rapamycin system by producing fusion proteins, lwo 
SirtCren retroviral Constructs were designed that encoded 

the e-eal Sa or Au, mutants, respectively (FRAP-iia ana 
FKBPl <-\Z). A prediction of the hypothesis ,s that introduc- 
Sn of these fusio'n proteins into cells would result in JglugJ 
ft oal activity in the absence of rapamycin (Fig. W). Thus 
Ka ac"w tv would first become detectable upon addition of 
drug As a result. «-gal activity would monitor tte w^«'2 
of FRAP and FKBP 12 proteins (green and red proteins and 
cDNAs in Fig. 1 B and C, respectively). 

A critical feature of the design of the system presented I he re 
waV to reduce protein expression levels as much as possible to 
S^pTriSiSte intracellular protein milieu. Accordingly. 
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the fusion proteins should serve as "tracers" of naturally 
occurring protein-protein interactions, overcoming the prob- 
lems previously experienced upon overexpression of intro- 
duced proteins either by transient or stable transfection, or 
transduction with retroviruses. To reduce expression levels, in 
the experiments reported here the cDNAs encoding 
FR AP-Aot and FKBP12--W were not inserted into MFC, but 
into the pWZL retroviral vector (P. J. Morgenstern, unpub- 
lished work), upstream of the gene encoding hygromycin or 
neomycin, respectively. The levels of protein expression are 
reduced in the pWZL vector due to the presence of mutations 
that delete the splice donor/acceptor sequences upstream of 
the ATG of the fusion proteins. These mutations result in a 
lower translation efficiency of the first coding sequence of a 
bicistronic message, but do not affect the translation of the 
second sequence, in this case the selectable marker, which is 
solely dependent on an encephaiomyocarditis virus internal 
ribosomal entry sequence. Using a pWZL vector, 50% less of 
the upstream protein is expressed compared with vectors 
containing wild-type splice donor/acceptor sequences (data 
not shown). As a result of the reduced levels of expression the 
frequency of spontaneous interactions of /3-gal mutants, which 
is concentration dependent, should be significantly reduced. 

Induction of 0-Gal Activity on Compression of FRAP-Acr 
and FKBP12-Aai Fusion Proteins in the Presence of Rapa- 
mycin. The FRAP-Aa-Neo and FKBP12-Ao>-Hygro vectors 
were tested in an established line of myoblasts, C2C12 (18). 
Infectious viral particles were produced by transient transfec- 
tion of each construct into the Phoenix packaging cell line 
(P. L. Achacoso and G. P. Nolan, unpublished work). C2C12 
myoblasts were infected either singly with each retrovirus 
alone or simultaneously with both. All experiments were 
performed after selection with hygromycin and G418 to ensure 
that 100% of the ceils contained the constructs. 

0-gai activity was measured in biochemical assays and found to 
be dependent on the presence of rapamycin in the medium. 
C2C12 cells expressing both fusion proteins were plated in 



Fig. 1. Experimental design. {A) When the Aa and A<*> 
/3-gal mutants are fused to proteins that do not dimerize. 
their association is not favored and /3-gal activity is not 
detected. {8) When the Aa and Aw /3-gal mutants are fused 
to proteins that can dimerize, the formation of active /3-gal 
is favored. (C) Schematic representation of the FKBP12- 
Ao>-Neo and the FRAP-Aa-Hygro constructs. IRES, inter- 
nal ribosome entry sequence; LTR, long terminal repeat. 

replicate in 96-well plates. Rapamycin was added to the culture 
medium, and the /3-gal activity was measured at different time 
points. For each time point, six replicate samples were assayed 
with a sensitive chemiluminescence assay, as described (15)* In 
untreated control samples, no /3-gal activity was detected above 
background. Rapamycin at a concentration of 10 ng/ml induced 
a 30-fold increase in /3-gal activity within 5 hr. After 5 hr, /3-gal 
activity continued to increase, reaching a level 2 orders of 
magnitude above background within 20 hr (Fig. 2A). In control 
populations of cells expressing only one of the two constructs, 
0-gal activity did not increase above background when rapamycin 
was added (data not shown). The linear increase in /3-gal activity 
observed between 5 and 20 hr after rapamycin addition may be 
due to an increase in protein concentration resulting from cell 
proliferation, since the C2C12 cells have a doubling time of 12 hr. 
It is also possible that a donor peptides are stabilized when they 
are incorporated into a multimeric complex. In support of the 
latter possibility, Western blotting of cellular extracts with anti- 
bodies to £-gal revealed an increase in steady-state levels of /3-gal 
mutant peptides upon rapamycin treatment (data not shown); in 
addition, stabilization of complementing /3-gal peptides upon 
assembly of active multimeric enzymes has been reported by 
others (26). 

In Fig. 2B, the dose response curve is shown. j3-gal activity 
increased linearly with the dose of rapamycin in the 0-10 ng/ml 
range. This linearity suggests that 0-gai enzymatic activity can 
serve as a reporter to quantitate protein-protein interactions. 
The results of these experiments demonstrate that the interaction 
of the FKBP12- and FRAP-/3-gal fusion proteins in the presence 
of rapamycin is specific and exhibits a comparable dose-response 
curve to results previously obtained by others for the FKBP12/ 
FRAP/rapamycin complex alone (25). Thus, fusion to /3-gal 
peptides does not interfere with the interaction of the FKBP12 
and FRAP proteins. Moreover, endogenous FKBPJ2 and FRAP 
proteins are ubiquitously expressed and will interact in the 
presence of rapamycin, thereby competing with the introduced 
FRAP-Aa and FKBP12-,Aw fusion proteins, yet not generating 
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Fig. 2. Biochemical assay of induction of /3-gal activity upon 
chimera complementation. (A) Kinetics of induction of /3-gal activity 
upon treatment with rapamycin. Pure populations of C2C12 cells 
stably expressing both FKBF'l2-A<u and FRAP-Aor were plated in 
96-well plates and 10 ng/ml rapamycin was added at time zero. Cells 
were then lysed at different time intervals thereafter, and the /3-gal 
activity in the lysatcs was quantitatcd by chemiluminescence. (B) Dose 
response of /3-gai activity upon rapamycin treatment. C2C12 cells 
expressing both FKBP12-Au> and FRAP-Aa were plated in 96-well 
plates and treated with different concentrations of rapamycin for 3.5 
hr. /3-gal activity is expressed as luminescence counts per second. Each 
point represents the average of six replicate samples. Error bars 
indicate standard deviations from the mean. 

/3-gal activity. Our results indicate that productive FR AP-Aa and 
FKBP12-Aa> dimers will also form, generating /3-gal activity that 
will reflect the interaction of FRAP with FKBPI2-rapamycin in 
that cellular environment even in the presence of the competing 
endogenous proteins. This finding suggests that it will be possible 
to use /3-gal complementation as a tool to analyze protein-protein 
interactions generally. 

)3-gaI activity was assayed using the sensitive Fluor-X-Gal 
histochemical stain, which allows simultaneous tricolor fluores- 
cence analysis, as described (15). Because of its broad emission 
spectrum, the fluorescence of this substrate can be visualized at 
wavelengths that yield either a red (Fig. 3 A and B) or a green 
signal (Fig. 3 C and D). In the absence of rapamycin, only a very 
weak /3-gal fluorescence was observed. For example, in Fig. 2A. 
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in which two fluorochromes were used at low magnification, the 
blue nuclei of the myoblasts was primarily evident with little 
evidence of red /3-gal activity. Similarly, in Fig. 3C in which three 
fluorochromes were used at higher magnification, blue nuclei and 
red phalloidin-staining of actin were evident, but green /3-gal 
activity was barely detectable. In both histochemical assays, in the 
presence of 10 ng/ml rapamycin, the activity of /3-gal increased 
significantly to produce an intense stain that was primarily 
localized in the cytoplasm, as expected (Fig. 3 B and D). Detec- 
tion of the complemented /3-gal enzyme was enhanced by imaging 
with a deconvolution microscope (Fig. 3D). These images show 
that protein complexes forming outside the nucleus are readily 
detectable using this assay. 

The /3-gal activity of a population of cells was assayed in the 
presence and absence of 10 ng/ml rapamycin by FACS. Using this 
sensitive assay, we were able to detect increased /3-gal activity in 
most of the cells after only 90 min of rapamycin treatment (Fig. 
4A). A range of expression levels was seen, as evidenced by the 
breadth of the peak of emission in the presence and absence of 
the drug (Fig. 4A. blue and red profiles). This breadth is pre- 
sumably due to variable efficiency of expression of each of the 
retroviral vectors following integration in the target cell. This 
inference is supported by the finding that when the 25% of the 
cells expressing the lowest /3-gal activity in the absence of rapa- 
mycin were collected (Fig. 45) and then reassayed in the presence 
and absence of rapamycin, a clear distinction between the two 
populations was seen (Fig. 4C). Thus, nonoverlapping popula- 
tions of cells that do or do not express complementing fusion 
proteins can be identified and is >lated by FACS. 

Rapamycin-Dependent Induction of 0-Gal Activity in Cell 
Lysates. To test whether the hexrodimerization of FRAP-Aa 
and FKBP12-Au> induced by rapamycin required cotransla- 
tional assembly or could occur with fully synthesized and 
folded proteins. /3-gal activity was assayed in a cell free system. 
Cells expressing both constructs were grown in the absence of 
rapamycin and lysed in situ in 50 fi\ of a 1 : 1 mixture of lysis and 
assay buffers containing Galacton Plus substrate from the 
Galacto-Light Plus assay kit (Tropix). Rapamycin was then 
added to the lysates, and /3-gal activity was quantitated imme- 
diately. 1 and 3 hr later. As a control, /3-gal activity was 
measured in a parallel set of lysates that were not exposed to 
rapamycin at all. No statistically significant increase in /3-gal 
activity was detected in the samples that did not receive 
rapamycin. By contrast, a more than 2-fold increase in /3-gal 
activity was observed in the rapamycin-treated lysates 1 hr 
after drug administration. The increase in activity detected in 
lysates is only a fraction of the increase observed upon 
rapamycin treatment of intact cells, a finding that very likely 
reflects the lower concentration of FR AP-Aa and FKJ3P12-Aa> 
in the lysates. This is probably due to the combined effects of 
the instability of /3-gal mutants within lysates and of their 
dilution during lysis. Nevertheless, these experiments demon- 
strate that de novo synthesis is not required for complemen- 
tation and that folded proteins can be induced to form 
complexes that can be monitored by /3-gal activity. 

DISCUSSION 
Prerequisites for Monitoring Protein-Protein Interactions by 
0-Gal Complementation. We have shown that /3-gal activity can 
be used to monitor the interaction of chimeric proteins. Critical 
to the success of this system was the choice of two poorly 
complementing /3-gal mutants, since strongly complementing 
mutants spontaneously assemble and produce functional /3-gal 
activity detectable in the absence of any other protein constitu- 
ents (15). By contrast, the weakly interacting Aa or Ao> /3-gal 
mutants expressed from the pWZL vector do not yield detectable 
enzymatic activity, unless their local concentration is increased by 
forcing them to heterodimerize. This is only achieved by synthe- 
sizing them as fusion proteins in which the non-/3-gal portions of 
the chimeras have sufficient affinity to drive the interaction. 
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Fig 3 Hisiochemical assay of induction of 0-gal activity upon chimera complementation. C2C12 cells expressing both FKBP12-Aa> and 
FR A P-Aa were maintained overnight cither in the absence (A and C) or the presence (B and D) of 10 ng/ml rapamycin. 0-gal activity was visualized 
hv fluorescence microscopy using Fluor-X-Gal as substrate. (A and B) Double-labeled samples showing Hoechst stained nuclei (blue) and 0-gal 
;i ctivitv using Fluor-X-Gal as substrate viewed with a rhodamine filter set (red). (C and D) Triple-labeled samples obtained by imaging with a 
DcUaVision microscope showing 0-gal activity using Fluor-X-Gal as substrate (green), Hoechst stained nuclei (blue), and Cy^-labeled actin 
filaments (red) to visualize the contour of each cell at higher magnification. 



The molecular basis of the impaired ability of the Aa and Aw 
mutants to associate spontaneously and recreate active 0-gal is 
unclear. Potential insights derive from prokaryotic studies arid the 
recently published crystal structure of intact 0-gal (27). It is well 
known that efficient j3-gal complementation in prokaryotes re- 
quires intermolecular interactions resulting in the sharing of 
domains between a acceptor and w donor mutants (20). More- 
over, the recently published structure of wild-type 0-gal suggests 
that both the a and w domains are involved in a dimerization step 
critical to the assembly of functional enzymes. These domains are 
also involved in essential contacts with the relatively large central 
region of the molecule (27). Aa 0-gal can be efficiently and 
spontaneously complemented in mammalian cells by an a donor 



lacking the central jx portion of the molecule (15). Similarly. Aw 
0-gal can complement spontaneously with an w donor lacking the 
central domain. In contrast, the weakly complementing mutant 
pairs used here each contain a large intact central domain. 
Thus, one possible explanation for the observed lack of sponta- 
neous complementation between the Aw and the Aa peptides is 
that the presence of the #x domain in both mutants may diminish 
their affinity for each other by steric hindrance or by sequestering 
either the a or the w domain in an intramolecular interaction. 
Accordingly, we postulate that the lack of spontaneous assembly 
and generation of 0-gal activity can be overcome by increasing the 
local concentration of Aa and Aw by forced association of these 
two mutants in chimeras, thereby counteracting the potential 
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Fk;. 4. FACS analysis of induced /3-gal activity upon chimera complementation. The red peaks represent the untreated samples and the blue 
peaks represent samples treated with 10 ng/ml rapamycin. (A ) Induction of /3-gal activity in a population of C2C1 2 cells expressing both FKBP12-Aoj 
and FRAP-Au after 90 min of rapamycin treatment. The majority of the cells respond to rapamycin treatment with an increase in /3-gal activity. 
(/?) Subpopulation of cells selected on the basis of low /3-gal activity in uninduced conditions. (C) The same population was maintained overnight 
in the absence (red peak) or in the presence (blue peak) of rapamycin. The induced and uninduced populations yield essentially nonoverlapping 
peaks. The vertical axis represents relative cell number and the horizontal axis represents intensity of /3-gal fluorescence on a logarithmic scale. 



inhibitory effect imposed by the presence of two central domains. 
Testing of this model will require further mutagenesis, activity 
assays, and x-ray crystaliographic analysis of the resulting struc- 
tures. 

Irrespective of the precise molecular nature of the interactions, 
the data presented here provide strong evidence that by engi- 
neering constructs in which domains or proteins of interest drive 
the dimerization of Aa or Aw )3-gal mutants, it will be possible to 
monitor such interactions by measuring the level of /3-gal activity 
following coexpression of these fusion proteins in intact cells. 
Moreover, the present system should theoretically allow detection 
of complexes in subcellular compartments, including the nucleus, 
the cytoplasm, or the membrane. Finally, as shown here, protein 
dimerization can be monitored in the context of the cell in the 
presence of endogenous competing protein partners. 

Potential for the Development of a "Mammalian Two-Hybrid** 
System. The experiments described in this report show that two 
distinct /3-gal mutants that do not readily assemble can be forced 
to interact and yield significantly increased levels of active en- 
zyme. This is achieved by coupling each mutant with one of a pair 
of highly interactive proteins, as chimeras or fusion proteins. In 
this case, we have used FRAP and FKBP12, proteins that only 
interact in the presence of a small molecule, rapamycin. Using this 
tripartite complex, we were able to show that the affinity of the 
non-/3-gal proteins drove the interaction and that the /3-gal 
components generated enzymatic activity as a result, serving to 
monitor the extent of that interaction. The levels of /3-gal activity 
in the presence and absence of forced dimerization were clearly 
separable by both biochemical and FACS assays, suggesting that 
this system could be used to screen for as yet unidentified protein 
partners. The target protein fused to a complementing /3-gal 
mutant (bait) could be stably expressed in a well-characterized 
cell line. Expression libraries containing cDNAs fused to a /3-gal 
deletion mutant could be introduced into these cells with high 
efficiency using retroviral vectors (28). Finally, gene products that 
interact with the bait could be isolated by identifying /3-gal 
positive clones. A potential advantage of this system over systems 
described previously is that the screen could be carried out in any 
cell type with its own particular milieu of competing resident 
proteins. An attractive possibility is that the bait could be targeted 
to a given cellular compartment, with the aim of identifying 
proteins involved in interactions restricted to that specific loca- 
tion. This mammalian "two-hybrid" screen could also be carried 



out in the presence of extracellular signaling molecules, growth 
factors, or differentiation factors, that might alter the potential for 
dimerization of two given proteins in particular cell types. Taken 
together, this approach to the study of protein-protein interac- 
tions should greatly enhance our understanding of the develop- 
ment of diverse cell types and organisms and how that develop- 
ment goes awry. 

We are grateful to Bruce Blakely for constructive comments on the 
manuscript, to Oivin Guicherit and Norris Turner for technical advice, 
and to Gerald Crabtree for useful discussions. F.R. was supported by a 
long-term postdoctoral fellowship from the Human Frontiers in Science 
Program. H.M.B. is the recipient of an National Institutes of Health 
MERIT award. 
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The structural organization of the low molecular 
mass form (43 kDa) of Bordetella pertussis adenylate 
cyclase was dissected taking advantage of the known 
sequence of the bacterial cya gene (Glaser, P., Ladant, 
D., Sezer, O., Pichot, F., Ullmann, A., and Danchin, A. 
(1988) Mol. Microbiol. 2, 19-30) and its low content 
of Trp and Met residues. Cleavage of the 43-kDa pro- 
tein and of its complementary tryptic fragments (T25 
and T18 peptides) with iV-chlorosuccinimide and cyan- 
ogen bromide followed by sodium dodecyl sulfate-poly- 
acrylamide gel analysis of digestion products allowed 
the following conclusions: (i) the catalytically active 
43-kDa form of B. pertussis adenylate cyclase is within 
the first 400 residues of the protein encoded by the cya 
gene. T25 occupies the N- terminal domain of the pro- 
tein (residues 1-235/237). Isolated T25 fragment ex- 
hibits a low but measurable enzymatic activity which 
indicates that it harbors the catalytic site; (ii) T18 
which is the main calmodulin-binding domain, occupies, 
the C- terminal segment of protein (residues 236/238- 
399) and is devoid of catalytic properties; (iii) the two 
complementary peptides T25 and T18 reassociated 
only in the presence of calmodulin, leading to signifi- 
cant recovery of the original activity. These results 
demonstrate that both fragments of the 43-kDa form 
of adenylate cyclase are essential for a high level of 
enzymatic activity. 



Calmodulin-dependent adenylate cyclase is one of the fac- 
tors implicated in the virulence of Bordetella pertussis, the 
causative agent of whooping cough (1). The existence of 
different molecular forms of this bacterial enzyme was re- 
ported by several groups (2-4). However, the size of the 
protein required for "invasiveness" of adenylate cyclase, Le. 
its ability to enter eukaryotic cells, was difficult to ascertain 
in the absence of any structural data on the bacterial enzyme. 
Molecular cloning of B. pertussis adenylate cyclase gene 
showed that the protein is synthesized as a large precursor of 
1706 residues. From genetic evidence it appeared that the 
calmodulin-; stimulated catalytic activity resides in the 450 N- 
terminal amino acids of adenylate cyclase (5). On the other 
hand, biochemical studies showed that the low molecular mass 
form of B. pertussis adenylate cyclase isolated either from 
culture supernatants or from bacterial extracts consists of 

* This work was supported by Grant URA 150 (UA 1129) from the 
Centre National de la Recherche Scientifique. The costs of publica- 
tion of this article were defrayed in part by the payment of page 
charges. This article must therefore be hereby marked "advertise- 
ment" in accordance with 18 U.S.C. Section 1734 solely to indicate 
this fact. 

** To whom correspondence should be addressed. 



three structurally related peptides of 50, 45, and 43 kDa (6). 
Limited proteolysis with trypsin of the adenylate cyclase/ 
calmodulin complex first converted the 50- and 45-kDa pep- 
tides to the 43-kDa form, then the latter was cleaved into two 
complementary fragments of M T = 25,000 and M, =» 18,000 
called T25 and T18, which were still held together by calmo- 
dulin (CaM) 1 in an active, native-like structure (6). Cross- 
linking experiments with azido-CaM suggested that each frag- 
ment interacted with CaM when the adenylate cyclase-CaM 
complex was formed. 

In this paper we further dissected the structural organiza- 
tion of the low molecular weight form of adenylate cyclase by 
cleavage at specific residues with chemical or enzymatic re- 
agents. Our aim was to locate more precisely the CaM -binding 
and the catalytic sites of B. pertussis adenylate cyclase. We 
identified the T25 peptide as the N-terminal domain of ade- 
nylate cyclase which harbors the catalytic site, whereas the 
C-terminal T18 peptide is mainly involved in the binding of 
CaM. 

MATERIALS AND METHODS 

Chemicals— Adenine nucleotides were from Boehringer Mannheim. 
Bovine brain CaM, TPCK-trypsin, and soybean trypsin inhibitor 
were from Sigma. Methyl 4-azidobenzimidate was purchased from 
Pierce Chemical Co. Urea (fluorimetrically pure) was a product of 
Schwartz/Mann. [ w P]H*P0 4 (carrier- free), Icr- W P]ATP (3000 Ci/ 
mmol). ( 3 HJcAMP (40 Ci/mmol) and Na 125 I (1000 Ci/mmol) were 
obtained from the Radiochemical Centre, Amersham (United King- 
dom). VU-8 CaM, made according to Craig et aL (7), was immobilized 
on CNBr- activated Sepharose 4B (a product of Pharmacia LKB 
Biotechnology inc.) as described by Haiech et at. (8). Bovine brain 
CaM was aztdated essentially according to Andreasen et of. (9) as 
described by Zurini et aL (10). Azido-CaM was iodinated by the 
chloramine-T method at room temperature to a specific activity of 
0.02 mot of iodine/mol of azido-CaM. Bz,ATP was synthesized as 
described by Williams and Coleman (11). "PJBzjATP (200 cpro/ 
pmol) was prepared according to Glynn and Chappell (12). Synthetic 
peptide corresponding to residues 235-254 of B. pertussis adenylate 
cyclase (5) was obtained by a solid phase method on chloromethyl- 
substituted polystyrene resin cross-linked by 1% divinyl benzene. The 
residues were added using the symmetrical anhydride coupling 
method (13, 14). Cleavage from the resin and removal of all remaining 
protecting groups were accomplished by a low/high fluorhydric acid 
procedure (15). 

Purification, Iodination, and Assay of Adenylate Cyclase— Adenyl- 
ate cyclase was extracted from bacterial cells (phase I, type strain 

1 The abbreviations used are: CaM, calmodulin; TPCK, L-l-to- 
sylamido-2-phenyIethyl chloromethyl ketone; NCS, W-cblorosucci- 
nimide; CNBr, cyanogen bromide; Bz,ATP, 3'-0-(4-benzoyl)ben- 
zoyladenosine 5' -triphosphate; SDS-PAGE, sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis; EGTA, (ethylenebis(oxyethyl- 
enenitrilo))tetracetic acid. Pza^za* is a synthetic peptide corresponding 
to residues 235-254 of B. pertussis adenylate cyclase. The sequence 
of Pzm-zm is: Arg GIu-Arg-Ile-Asp-Leu-Leu-Trp-Lys-Ile-Ala-Arg-AIa- 
Gly-Ala-Arg-Ser-Ala-Val-Gly. 
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ATCC 9797), with 4 M urea in buffer A (50 mM Tris-HCl, pH 8, 6 
mM MgCI 2> and Triton X-100) as described previously (16) and 
then purified in a single step chromatography on VU-8 CaM-Sepha- 
rose (8). Enzyme was iodinated by the chloramine-T method at room 
temperature to a specific activity of about 0.08-0.25 mol of iodine/ 
mol of protein. Adenylate cyclase activity was measured in 50 mM 
Tris-HCt, pH 8, containing 6 mM MgCl 2 , 0.1 mM CaCl 2 » 0.1 ^M bovine 
brain CaM, 0.5 mg/ml bovine serum albumin, and 2 mM [or- 32 P]ATP 
(5 x 10 5 cpm/assay) (16). One unit of adenylate cyclase activity 
corresponds to 1 pmo\ of cAMP formed in 1 min at 30 *C and pH 8. 

Preparation of a Homogeneous Low Molecular Mass Form (43 kDa) 
of B. pertussis Adenylate Cyclase — Iodinated adenylate cyclase (20 
jig/ml in buffer A, 5 x 10 s cpm) was supplemented with 2.5 nmol of 
bovine brain CaM and 0.2 Mg/ml TPCK-trypsin. After 10 min of 
incubation at 4 "C (enzyme activity declined by less than 10%), 1 pg/ 
ml soybean trypsin inhibitor was added, and then the reaction mixture 
was loaded onto a 40 x 1-cm column of Ultrogel AcA 44 equilibrated 
with buffer A. Elution was done at a flow rate of 4 ml/h. Fractions 
containing adenylate cyclase activity were pooled and precipitated 
with 5 volumes of cold acetone (— 10 °C). Precipitated proteins were 
separated by centrifugation, solubilized with 9% formic acid, then 
loaded onto a column of 1 x 66 cm of Sephadex G-100 equilibrated 
with 9% formic acid. Elution was done at a flow rate of 6 ml/h, and 
fractions of 1.1 ml were collected. After removal of formic acid by 
evaporation, samples were analyzed for purity by SDS-polyacrylamide 
gel and autoradiography. 

Photoaffinity Labeling — Photo affinity labeling experiments were 
performed at 4 *C for times varying between 1 and 30 min. The 
reaction mixture containing unlabeled adenylate cyclase and azido- 
125 I-CaM or [7- 32 P]Bz;iATP was irradiated with a "long wave" mercury 
lamp (mineral light UVSL 58) positioned at 5 cm from the samples. 
Samples were then run on a 12.5 or 17.5% sodium dodecyl sulfa te- 
polyacrylamide gel electrophoresis (SDS-PAGE) as described by 
Laemmli (17). Gels were either dried or fixed in 10% acetic acid and 
further washed with 25% isopropyl alcohol and 10% acetic acid. For 
detection of radiolabeled peptides on SDS-PAGE, dried gels were 
exposed at -80 "C to Kodak X-Omat AR films for 4-48 h with 
intensifying screens. 

Reactivation of Adenylate Cyclase after SDS-PAGE— Iodinated ad- 
enylate cyclase or its tryptic fragments separated by SDS-PAGE were 
sliced from the gel with a razor blade, rehydrated in case of dried gels, 
and washed extensively with 25% isopropyl alcohol and then with 
10% methanol to remove SDS. Slices in Eppendorf tubes were then 
soaked in 0.8 ml of 8 M urea in 50 mM Tris-HCl, pH 8, containing 
0.1 mM CaCl 2 and 1% Nonidet P-40, for 18 h at 37 °C. Urea solutions 
containing different peptides were supplemented, or not, with bovine 
brain CaM and then dialyzed extensively against buffer A and assayed 
for adenylate cyclase activity as described above. 

Cleavage of Adenylate Cyclase or of Its Tryptic Fragments with iV- 
Chlorosuccinimide and Cyanogen Bromide — Peptide bands identified 
by SDS-PAGE and autoradiography were sliced from the gels, rehy- 
drated when necessary, and then washed with either urea/water/ 
acetic acid (1:1:1, w/v/v) (Trp cleavage) or 70% formic acid (Met 
cleavage) for 20 min. Peptides were cleaved by immersion of the gel 
slices at room temperature or at 37 'C in 15 mM NCS in urea/water/ 
acetic acid for 30 min (Trp cleavage) or in 4% CNBr in 70% formic 
acid for 90 min (Met cleavage) (18-20). After several washings with 
water the slices were equilibrated in 10% glycerol, 5% /?-mercaptoeth* 
anol, 2% SDS, and 62 mM Tris-HCl, pH 6.8, with several changes 
and loaded on a resolving gel. After SDS-PAGE the gels were dried 
and autoradiographed. 

Analytical Procedures — Amino acid analysis was performed on a 
Biotronik amino acid analyzer LC 5001 using a single column proce- 
dure (21). Fluorescence measurements were performed on a Perkin- 
Elmer LS-5B luminescence spectrometer thermostated at 25 °C, using 
a 1 x 1 UV-grade quartz cuvette (sample volume of 2 ml). Emission 
spectra of P^s-jm peptide (\„ c = 290 nm) in the presence or absence 
of Ca 2+ or EGTA were recorded from 300 to 450 nm. Electrophoresis 
under nondenaturing conditions was performed according to Williams 
and Reisfeld (22) using a 13.75% resolution gel. 

RESULTS 

Amino Acid Composition of the 43-kDa Form of B. pertussis 
Adenylate Cyclase — Amino acid analysis of purified prepara- 
tions of bacterial adenylate cyclase is complicated by the fact 
that the enzyme consists of three structurally related peptides 



of 50, 45, and 43 kDa, as judged by SDS-PAGE. The propor- 
tion of these peptides varies greatly from one preparation to 
another. To circumvent this complicating factor we converted 
50- and 45-kDa forms to the 43-kDa peptide by mild digestion 
with trypsin (100:1, w/w ratio) at 4 °C in the presence of a 5- 
fold excess of CaM over adenylate cyclase. The 43-kDa pep- 
tide was then separated from CaM and low molecular mass 
protein fragments by gel filtration under denaturing condi- 
tions. This procedure generated a single species of adenylate 
cyclase suitable for amino acid composition analysis. Alter- 
natively, some preparations of adenylate cyclase were homog- 
enous enough (8) to be suitable for amino acid analysis. Table 
I shows the amino acid composition of the 43-kDa peptide as 
determined from two separate preparations of enzyme. The 
third column in Table I indicates the amino acid composition 
of the peptide encoded by the first 399 amino acids (M r « 
42,939) of adenylate cyclase as predicted from the nucleotide 
sequence of the cya gene. The number of 399 residues was 
arbitrarily chosen based on the closest M, value to 43,000 and 
the fact that Arg-399 is a potential site of tryptic cleavage. 
With few exceptions (Met and Arg) the numbers of all other 
residues were remarkably similar. From these data one may 
assume that the low molecular weight form of B. pertussis 
adenylate cyclase may, indeed, correspond to the first 399 
residues of the large 1706-residue precursor. However, without 
determination of the N-terminal sequence of the protein (our 
attempts to identify it failed, probably due to the fact that 
the N-terminal residue is blocked) other probes had to be 
taken into consideration. 

Selective Tryptophanyl and Methionyl Peptide Bond Cleav- 
age of 43-kDa, T25, and TIB Peptides— According to the 
primary structure of B. pertussis adenylate cyclase as deduced 
from the nucleotide sequence of cya gene corresponding to 
residues 1-399, the partial or complete cleavage of 43-kDa 
peptide at Trp-69 and Trp-242 should generate five fragments 
(Fig. \B). This was indeed the case, as shown by the SDS- 
PAGE pattern of the NCS-digested iodinated protein (Fig. 2). 
The molecular weights of fragments calculated from a cali- 
bration curve of five molecular weight markers (Fig. IB) 
corresponded within experimental error to the expected val- 
ues, with a single exception. The smallest peptide fragment 

Table I 



Amino acid composition of B. pertussis adenylate cyclase (43-kDa 
protein) 



Residue 


Measured* 


Deduced from DNA sequence 
(residues 1-399) 


Cys 


0 


0 


Asn + Asp 


42.8 


42 


Thr 


16.8 


18 


Ser 


24.0 


28 


Gin + Glu 


37.1 


38 


Pro 


12 


15 


Gly 


42.6 


41 


Ala 


48.1 6 


50 


Val 


32.4 


32 


Met 


3.4 


7 


lie 


15.1 


15 


Leu 


28.8 


29 


Tyr 


9.9 


12 


Phe 


13.9 


14 


His 


9.2 


9 


Lys 


13.1 


15 


Arg 


38.1 


32 


Trp 


ND r 


2 



°The values given are uncorrected and determined after 20 h 
hydrolysis in 6 N HC1 at 110*C. 

6 Arbitrarily taken as reference for M t = 43,000. 
*" ND, not determined. 
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Fig. 1. Working model of the structural organization of the 
low molecular mass form of B. pertussis adenylate cyclase 
(43-kDa peptide). The positions of Met and Trp residues in the 43- 
kDa peptide, as deduced from the nucleotide sequence of the first 
quarter of B. pertussis cya gene, are indicated in A. The fragments 
resulting from partial or complete cleavage of adenylate cyclase after 
treatment with NCS, as well as the calculated M, of fragments, are 
indicated in B. The molecular mass determined by SDS-PAGE using 
five proteins of known molecular weight is also indicated above each 
fragment. 
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Fig. 2. Cleavage of 43-kDa, T25 and T 18 peptides with N- 
chlorosuccinimide. 125 I-AdenyIate cyclase {0.2 units, about 4 x 10* 
cpm) in 100 m1 of buffer A was supplemented with 0,4 mm bovine 
brain CaM and 15 ng of TPCK-trypsin. After 10-min incubation at 
4 °C, proteolysis was stopped with 0.1 /ig of soybean trypsin inhibitor 
and aliquots of 5-20 p\ were run on a 12.5% SDS-PAGE. Bands 
corresponding to uncleaved 43-kDa peptide, T25 and T18 were sliced 
from the gel and subjected (+) or not (— ) to treatment with NCS as 
described under "Materials and Methods.** Samples were run again 
on a 17.5% SDS-PAGE, and then the gels were dried and autoradi- 
ographed. The molecular weight markers (arrows on the left) were 
from top to bottom: bovine serum albumin, 67,000 (a); ovalbumin, 
43,000 (fc); carbonic anhydrase, 30,000 (c); soybean trypsin inhibitor, 
20,300 (d); and lysozyme, 14,400 (e). The assignment of different 
peptides {arrows on the right) was made on the basis of their molecular 
weights and the primary structure (residues 1-399) of adenylate 
cyclase as deduced from the nucleotide sequence of the cya gene. 

corresponding to residues 1-69 gave systematically higher 
molecular weight values than expected. This seems to be due 
to an anomalous migration of the 1-69 peptide since its 
complementary fragment (residues 70-399) had a molecular 
weight very close to the expected value. 

Chemical cleavage at the tryptophanyl peptide bond of 
complementary T25 and T18 peptides gave a characteristic 
pattern which allowed assignment of T25 to the N-terminal 
domain of adenylate cyclase and T18 to the C-terminal do- 
main of the enzyme. As shown in Fig. 2, T25 was cleaved by 
NCS into two fragments of apparent M t of 16,600 and 10,800, 
respectively. The smallest of these migrated in SDS-PAGE 
exactly like the smallest peptide resulting from NCS cleavage 
of the 43-kDa peptide, which was previously assigned to 
residues 1-69. After NCS treatment, T18 gave a faint band of 
16.5 kDa differing by only 1 kDa from the parent peptide. 
This indicates that the site of tryptic cleavage of the 43-kDa 
form of adenylate cyclase is slightly upstream of Trp-242. 
From the primary structure of the 43-kDa peptide, as deduced 
from the DNA sequence, it is reasonable to assign the site of 



trypsin cleavage to Arg-235 or Arg-237 (Fig. M). 

To further substantiate the location of T25 and Tl8 frag- 
ments in the primary structure of the B, pertussis adenylate 
cyclase 43-kDa peptide, these fragments were subjected to 
CNBr cleavage. The digestion pattern was more complex than 
in the case of NCS cleavage, due to the presence of seven 
' methionine residues in the 43-kDa peptide. However, T18, 
which contains only 2 methionine residues, should have had 
a much simpler CNBr-cIeavage pattern. This was indeed the 
case (Fig. 3). 

Cross-linking of T25 and T18 Peptides to Azido-™I-CaM 
and Selective Cleavage with NCS — Unlabeled adenylate cy- 
clase complexed to azido- 125 I-CaM was submitted to trypsin 
digestion and then photolysed. The cross -linked products of 
63, 45, and 38 kDa corresponded to the 43-kDa, T25 and T18 
peptides, respectively, covalently bound to azido- 125 I-CaM (6). 
Since CaM has no Trp residues, NCS cleavage of T25-CaM 
complex should tell us which segment of T25, situated before 
or after Trp-69, is involved in CaM binding. As shown in Fig. 
4 {lane 5), after NCS cleavage of T25-CaM cross-linked prod- 
uct, a new radioactive band of 37 kDa appeared which seems 
to indicate that the segment of T25 responsible for CaM 
binding is situated after Trp-69. The cross- linked product of 
T18 and CaM did not generate any visible cleavage product 
after treatment with NCS. 

Reconstitution of Catalytically Active Forms of Adenylate 
Cyclase from Fragments Resulting from Tryptic Cleavage of 
43-kDa Peptide — Even after complete cleavage by trypsin of 
the CaM-complexed 43-kDa peptide, adenylate cyclase re- 
tained 70-80% of its original activity. Gel permeation chro- 
matography of the CaM-T25-Tl8 complex revealed the same 
hydrodynamic properties as the undigested CaM-adenylate 
cyclase complex (6). Upon exposure to 8 M urea for 1 h 
followed by dilution, the CaM-T25-Tl8 complex was found 
to be inactive. However, when the trypsin -digested adenylate 
cycIase-CaM complex was exposed to 8 M urea and dialyzed 
against buffer A, 50% of the initial activity was recovered. 
Gel permeation chromatography of reactivated CaM and ad- 
enylate cyclase fragments again displayed the same hydrody- 
namic properties as the undigested or digested/undenatured 
CaM-adenylate cyclase complex (Fig. 5). These data strongly 
suggest that T25 and T18 refold after removal of urea and 
reassociate in the presence of CaM to yield a ternary T25- 
Tl8-CaM complex which is enzymatically active. 

Since it has been shown that adenylate cyclase can be 
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Fig. 3. Cleavage of 43-kDa, T25, and T18 peptides with 
cyanogen bromide. Iodinated adenylate cyclase was partially 
cleaved with TPCK-trypsin as described in the legend of Fig. 2. 
Peptide bands sliced from the gel were subjected (+) or not (— ) to 
treatment with CNBr as described under "Materials and Methods." 
Samples were run again on a 17.5% SDS-PAGE, and then the gel 
was dried and autoradiographed using different exposure times to 
better visualize bands of different intensities. The molecular weight 
markers were the same as described in Fig. 2. 
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Table n 

Adenylate cyclase activity of T25 and T18 peptides renatured in the 
presence or absence of CaM after SDS-PAGE 
Reactivation of adenylate cyclase after SDS-PAGE is described 
under "Materials and Methods." Adenylate cyclase activity was de- 
termined in the pre sence of 100 nM CaM and 1 mM ATP. 

CaM (UM) during 
renaturation 



5 6 7 



12 3 4 

Fig. 4. Cross-linking of azido- 1M I-CaM to tryptic frag- 
ments of adenylate cyclase and cleavage of cross-linked prod- 
ucts with TV-chlorosuccinimide. Adenylate cyclase (0.2 units) was 
supplemented with 100 nM azido- 125 I-CaM and subjected to trypsin 
proteolysis as described in Fig. 2. Proteolysis was stopped with 
soybean trypsin inhibitor, then samples were irradiated for 1 min as 
described under "Materials and Methods," and run on a 10% SDS- 
PAGE. Radioactive bands corresponding to cross-linked products 
were sliced from the gel, subjected (+) or not (-) to treatment with 
NCS as described under "Materials and Methods," run again on a 
12.5% SDS-PAGE, and then autoradiographed. Lane I, 43-kDa pep- 
tide cross-linked to azido- ,25 I-CaM; lanes 2 and 5, T25 cross-linked 
to azido- 125 I-CaM; lanes 3 and 6, Tl8 cross-linked to azido- 125 I-CaM; 
lanes 4 and 7, azido- ,25 I-CaM. Arrows indicate the same molecular 
weight markers (a to d) as in Figs. 2 and 3. 
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Fig. 5. Gel permeation chromatography of trypsin -cleaved 
adenylate cyclase-CaM complex after urea denaturation and 
renaturation. l25 I-adenylate cyclase (0.09 units; specific activity 3 
units/10 7 cpm l25 I) in buffer A containing 0.6 *tM CaM was digested 
for 10 min with 75 ng of trypsin at 4 *C. The sample (residual specific 
activity, 2 units/10 7 cpm was diluted in 200 m! of buffer A 
containing 8 M urea, 20 ng of soybean trypsin inhibitor, and 20 ^g of 
CaM and incubated 1 h at 30 "C. After an extensive dialysis against 
buffer A at 4 "C, (recovered adenylate cyclase activity, 0.88 units/10 7 
cpm lz5 I), the mixture was loaded onto an Ultrogel AcA 44 column 
(0.6 x 26 cm) equilibrated in buffer A containing 20 jig/ml soybean 
trypsin inhibitor. Fractions of 0.1 ml were collected at a flow rate of 
0.5 ml/h and analyzed for enzyme activity (O) and radioactivity (•). 
Inset shows an autoradiogram after a 12.5% SDS-PAGE of fractions 
corresponding to the peak of activity. The four molecular weight 
markers were: bovine serum albumin, 67,000 (a); ovalbumin, 43,000 
(6); Escherichia coli adenylate kinase, 23,500 (c); and soybean trypsin 
inhibitor, 20,300 (d). AC designates the native adenylate cyclase-CaM 
complex. 

reactivated in a high (about 50%) proportion after SDS-PAGE 
(6), we performed reconstitution experiments with T25 and 
T18 isolated by electrophoresis under denaturing conditions 
(see "Materials and Methods"). T18 renatured alone or in 
association with CaM did not show any cyclase activity. T25 
exhibited a very low but measurable, enzyme activity (ap- 
proximately 0.1-0.2% of the uncleaved protein at saturating 
CaM concentration), irrespective of the presence or absence 



Peptide 



Specific 
activity 



Recovery of 
activity* 







unitsfW cpm 


% 


43 kDa 


Present 


2.83 


54 


T25 


Present 


0.012 


0.22 


T18 


Present 


0.000 


0.00 


T25 + T18 


Absent 


0.026 


0.50 


T25 + T18 


Present 


0.98 


18.6 



a The specific activity of 125 l -adenylate cyclai 
tion was taken as 100% (5.25 units/10 7 cpm). 




Wavelength ( nm ) 

Fig. 6. Ca a *-dependent binding of synthetic Pass-as* peptide 
to CaM as shown by fluorescence and gel retardation analysis 
(inset). Emission spectra of Pzus^m (5 mM) in 20 mM Tris-HCl, pH 
7.5, 0.1 M NaCI, and 0.2 mM CaCl* were recorded from 300 to 450 
nm. A, synthetic peptide alone; B, synthetic peptide plus 5 bovine 
brain CaM; C, same as B f after addition of 2 mM EGTA. Inset shows 
the electrophoretic behavior of CaM (5 *ig) under native conditions 
in the presence of P^2&* peptide and excess Ca 2 * (0.2 mM) or EGTA 
(2 mM). Lanes 1 and 3, no peptide present; lane 4, peptide/CaM 
(molar ratio), 0.5:1; lane 5, peptide/CaM, 1:1; lanes 2 and 6\ peptide/ 
CaM, 4:1. 



of CaM during renaturation. Assuming that CaM activation 
of S. pertussis adenylate cyclase is between 20- and 50-fold, 
the catalytic activity of T25 might be estimated as between 2 
and 10% of that of the whole 43-kDa peptide. When a urea 
solution containing complementary fragments was mixed with 
CaM prior to the dialysis step, adenylate cyclase activity was 
recovered to a significant level (Table IT). If CaM was omitted 
during renaturation of T25 and T18, recovery of catalytic 
activity was considerably lower. 

From these results one may assume that the catalytic site 
of adenylate cyclase is located on the T25 domain, but full 
catalytic activity is expressed by a structure in which T25 and 
T18 are "bridged" together by CaM. The values of the appar- 
ent K m for ATP of the native 43-kDa form, of the enzyme 
renatured after SDS-PAGE or of the reconstituted T25-T18- 
CaM complex were between 0.6 and 0.8 mM. 

Interaction of the Synthetic Peptide P235-254 with Bovine 
Brain Calmodulin— Although it is difficult to identify a typical 
CaM-binding site on the sequence of B. pertussis adenylate 
cyclase like the site described for skeletal or smooth muscle 
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Fig. 7. Reversal of the CaM-dependent activation of B. per- 
tussis adenylate cyclase by synthetic peptide P23&-364. CaM 
diluted to desired concentrations was added in 5-/xl aliquots to the 
reaction mixture described under "Materials and Methods" (•). 
When present, P 235-254 was at 0.5 mm (♦) and 5 (A), The reaction 
was initiated by the addition of adenylate cyclase (0.5 ng/assay). The 
fraction of maximal activation was calculated as (V — V 0 )/(V m „ — 
V 0 ), where Vo and are the reaction rates in the absence of CaM 
or in the presence of saturating concentrations of CaM; V is the 
reaction rate at a given concentration of CaM. The dissociation 
constant of synthetic peptide-CaM complex (580 nM) was estimated 
from the midpoints of the activation curves as described by Erickson- 
Viitanen and De Grado (26), 

myosin light chain kinase (23, 24), the sequence corresponding 
to residues 235-254 seems to be a candidate for such a CaM- 
binding site because: (i) this sequence belongs to the N 
terminus of the T18 peptide and joins the T25 fragment of 
adenylate cyclase via Arg-235 (or Arg-237); (ii) azido- 125 I- 
CaM preferentially photolabels the T18 tryptic domain of 
bacterial enzyme; (iii) this sequence has a Trp in its first half 
and shows a strong predominance of basic and hydrophobic 
residues (25). 

The fluorescence emission spectrum of synthetic peptide 
P235-254 had a maximum at 350 nm (Fig. 6) and was not 
influenced by the presence of Ca 2+ or EGTA. Addition of 
CaM in a 1:1 ratio to P235 2s< in the presence of 0.2 mM Ca 2+ 
caused a 1.82-fold increase in total fluorescence intensity and 
a shift of the spectral maximum to 325 nm. EGTA promoted 
dissociation of the CaM-P235-2M complex. The Ca 2+ -dependent 
CaM -binding properties of P235-2M were confirmed by electro- 
phoresis of CaM-peptide complex under nondenaturing con- 
ditions. In the presence of Ca 2+ , P235-254 retarded the migration 
of CaM relative to that achieved in the presence of EGTA 
(Fig. 6, inset). The synthetic peptide P235-2W reversed CaM- 
dependent activation of B. pertussis adenylate cyclase (Fig. 
7). The relatively high K d value of P235-2&4 for CaM (580 nM) 
as compared to other CaM-binding sequences (23-29) may 
indicate that it represents only part of a high affinity binding 
domain. 

DISCUSSION 

Calmodulin regulates the activity of a large number of 
enzymes with various catalytic functions and biological sig- 
nificance. The molecular basis of the interaction of CaM with 
target proteins is beginning to be undeVstood on the basis of 
two independent approaches: (i) isolation of CaM-binding 
fragments from enzymatic or chemical digests of target en- 
zymes (25, 28, 30); (ii) synthesis of various peptides able to 
interact with CaM in a Ca 2 * -dependent fashion (27, 31, 32). 
A consensus emerging from these studies is that the activator- 
binding domains of CaM-dependent enzymes show only a 



limited degree of strict sequence homology but form basic 
amphiphilic helices, in agreement with earlier observations 
(33-35). 

An important advantage in approaching the location of 
catalytic and CaM-binding domains of R pertussis adenylate 
' cyclase is the high specific activity of the enzyme, the ease of 
its reactivation after different treatments including SDS- 
PAGE, and knowledge of the nucleotide sequence of the B. 
pertussis cya gene. Thus, we were able to show in agreement 
with genetic arguments (5) that the catalytically active, low 
molecular weight form of adenylate cyclase (present in bac- 
terial culture supernatant) corresponds to the first quarter of 
the coding sequence of B. pertussis cya gene. Selective cleavage 
of the protein by trypsin yielded two fragments, one corre- 
sponding to the N-terminal domain carrying the catalytic site 
(T25), the second corresponding to the C-terminal domain, 
which is responsible for the interaction with CaM (T18). 

Location of the CaM-binding site of B. pertussis adenylate 
cyclase around Trp-242 was suggested by photoaffinity label- 
ing experiments of protein with azido- 125 I-CaM, as well as by 
the fact that synthetic peptide corresponding to residues 235- 
254 interacted with CaM in a Ca 2+ -dependent manner. How- 
ever, the affinity of the synthetic peptide for CaM, compared 
to other well known CaM-binding sequences, was unexpect- 
edly low to make certain such an assignment. Site-directed 
mutagenesis of Trp-242 with Arg, Gly, and Asp was helpful 
in confirming the location of CaM-binding site of adenylate 
cyclase. All three mutants were fully active, but affinity for 
CaM was decreased by a factor varying between 30 and 1000. 2 
One might envisage that the spatial relationship of the cata- 
lytic site and the CaM-binding site of B. pertussis adenylate 
cyclase differs from that of other CaM-dependent enzymes. 
The particular organization of the CaM-binding site of B. 
pertussis adenylate cyclase might also explain why the recon- 
stitution of catalytically active species of adenylate cyclase 
from isolated T25 and T18 fragments requires the presence 
of CaM, If we admit the following sequence of interactions: 



T18 + CaM ^ Tl8-CaM 
Tl8-CaM + T25 ^ T25-Tl8-CaM 



(a) 
(b) 



where the binary complex is of low stability and the ternary 
complex is of high stability, the limiting step in the formation 
of the catalytically active species is the formation of the T18- 
CaM complex. 

Attempts to locate the catalytic site on T25 peptide using 
the photoactivatable ATP analog Bz 2 ATP failed. Although 
irreversible binding of [7- 32 P]Bz 2 ATP to the enzyme upon 
photolysis was demonstrated, the photolabel was too unstable 
under conditions of protein cleavage. By comparing the se- 
quence situated between residues 54 and 70 of B. pertussis 
adenylate cyclase with the well known motif Gly-X-X-X-X- 
Gly-Lys-Thr-(Ser) in ATP-binding proteins (36-38), one may 
surmise that the polyphosphate binding site in B. pertussis 
adenylate cyclase is located near Lys-65. This lysine belongs 
to a 0-turn segment, as predicted from the primary structure 
of the protein using the algorithm of Gamier et al. (39). 
Similar sequences in ATP-binding proteins form glycine -rich 
flexible loops terminated by a lysine which, by changing its 
conformation and location, may control access to the MgATP 
site (36). Replacement of Lys-65 in B. pertussis adenylate 
cyclase with Gin yielded mutant enzyme exhibiting only 0.1% 
of catalytic activity of wild type protein and intact CaM- 
binding properties. 2 

2 Glaser, P., Elmaoglou-Lazaridou, A., Krim, E., Ladant, D., Barzu, 
% 0., and Danchin, A. (1989) EMBO J. t in press. 
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In order to identify molecular features of the calmodulin 
(CaM) activated adenylate cyclase of Bordetella pertussis y 
a truncated cya gene was fused after the 459th codon in 
frame with the ct-lacU gene fragment and expressed in 
Escherichia colL The recombinant, 604 residue long 
protein was purified to homogeneity by ion-exchange and 
affinity chromatography. The kinetic parameters of the 
recombinant protein are very similar to that of adenylate 
cyclase purified from B.pertussis culture supernatants, 
i.e. a specific activity >2000 ^mol/min mg of protein at 
30°C and pH 8, a A* 1 ? of 0.6 mM and a K d for its 
activator, CaM, of 0.2 nM. Proteolysis with trypsin in 
the presence of CaM converted the recombinant protein 
to a 43 kd protein with no loss of activity; the latter corre- 
sponds to the secreted form of B.pertussis adenylate 
cyclase. Site-directed mutagenesis of residue Trp-242 in 
the recombinant protein yielded mutants expressing full 
catalytic activity but having altered affinity for CaM. 
Thus, substitution of an aspartic acid residue for Trp-242 
reduced the affinity of adenylate cyclase for CaM > 1000- 
fold. Substitution of a Gin residue for Lys-58 or Lys-65 
yielded mutants with a drastically reduced catalytic 
activity (-0.1% of that of wild-type protein) but with 
little alteration of CaM-binding. These results sub- 
stantiated, at the molecular level, our previous genetic 
and biochemical studies according to which the N- 
terminal tryptic fragment of secreted B.pertussis 
adenylate cyclase (residues 1-235/237) harbours the 
catalytic site, whereas the C-terminal tryptic fragment 
(residues 235/237-399) corresponds to the main CaM- 
binding domain of the enzyme. 

Key words: ATP-binding site/ Bacillus anr/zracw/calmodulin- 
binding site/cyciolysin 



Introduction 

Bordetella pertussis adenylate cyclase is an intensely studied 
enzyme because of its potential role in the pathogenesis of 
whooping cough (Weiss and Hewlett, 1986). Two properties 
of adenylate cyclase from this organism make it an attractive 
model for structure -function relationship studies: (i) its 
activation , by calmodulin (CaM) which is not known to 
occur in bacteria (Wolff et al., 1980; Greenlee et aL, 
1982); and (ii) its ability to enter eukaryotic cells causing 
unregulated synthesis of adenosine 3' -5 '-monophosphate 



(cAMP) and impairment of normal cellular function (Confer 
and Eaton, 1982; Hanski and Farfel, 1985). Delineation of 
the mechanism of activation and catalysis in B.pertussis 
adenylate cyclase wouid provide the first instance where the 
active site of this heterogeneous group of enzymes could be 
characterized. 

Using the interaction between adenylate cyclase and CaM 
as a tool we have recently cloned the corresponding gene 
from B.pertussis , determined its nucleotide sequence and 
expressed it in Escherichia coli (Glaser et aL, 1988a). 
We have found that the protein is synthesized as a large 
Afunctional precursor form of 1706 amino acid residues, 
endowed with adenylate cyclase and haemolytic activity. The 
protein has been named cyclolysin to indicate this fact. Its 
secretion mechanism involves the haemolytic carboxy- 
terminai end (Glaser et aL, 1988b) in conjunction with 
the product of three genes located downstream from the 
adenylate cyclase gene. In B.pertussis, the large precursor 
form is processed to low molecular mass forms of 43, 45 
and 50 kd present in B.pertussis culture supernatant 
(Shattuck et at., 1985; Ladant et al, 1986). The secreted 
adenylate cyclase(s) corresponds to the amino-terminal 
domain of the large precursor form. By combining genetic 
and biochemical information (Glaser etaL, 1988a; Ladant, 
1988; Ladant etaL, 1989), we showed that the N-terminal 
tryptic fragment of the 43 kd form of adenylate cyclase (T25, 
residues 1 —235/237) harbours the active site, whereas the 
C-terminal tryptic fragment (T18, residues 235/237-399) 
corresponds to the main CaM-binding domain of the enzyme. 
Furthermore, a synthetic peptide corresponding to residues 
235—254 of adenylate cyclase was shown to bind to CaM 
in a Ca 24 "-dependent manner (Ladant et aL, 1989). 

In the present study, we examined the catalytic and 
CaM-binding properties of B.pertussis adenylate cyclase 
expressed in E.coli by a plasmid carrying a truncated cya 
gene which comprises the entire cyclase domain of cyclolysin 
fused to the a domain of /3-galactosidase. Site-directed 
mutagenesis of Lys-58, Lys-65 and Trp-242 revealed the 
role of these residues in catalysis or in binding of CaM. 



Results 

Expression and purification of a truncated B.pertussis 
cya gene product in E.coli 

Several clones deleted at the 3' end of the cyoA gene were 
obtained as side products of the sequence analysis of the 
B.pertussis adenylate cyclase gene using the cyclone strategy. 
Clones encoding enzyme with a M r close to that of 
adenylate cyclase purified from B.pertussis culture super- 
natants were transferred into the expression vector pTZ19 
(Pharmacia, Uppsala, Sweden). One such clone, plasmid 
pDIA5202, gave the highest level of adenylate cyclase 
activiry. As indicated in Figure 1, it fused sequences 
corresponding to the whole adenylate cyclase domain of 
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Table I. Purification of recombinant B.pertussis adenylate cyclase 



pDIA5202 



60 
120 
180 
240 
300 
360 
420 
4S0 
540 
600 



vpig* 

Fig. 1. Restriction map of plasmid pDtA5202 and deduced amino acid 
sequence of the recombinant adenylate cyclase. The 5 '-terminal end of 
the cyclolysin gene has been fused in phase with a truncated lacZ gene 
containing a sequence specifying the a domain of 0-galactosidase. This 
resulted in a chimeric protein containing the adenylate cyclase domain 
of cyclolysin (residues I -399), the start of the haemolysin domain 
(residues 400-459) and the ot domain of jS-galactosidase (residues 
460 -604, in lower case letters). The Arg pair (399/400) is 
particularly sensitive to trypsin action resulting in formation of a 43 kd 
protein, identical to the small adenylate cyclase form present in 
B.pertussis supernatants. The black box corresponds to the inserted 
B.pertussis chromosomal DNA. The truncated lacZ gene is shaded. 
Restriction sites are represented as follows: B, BamHl; E, EcoRl: V, 
£<roRV; S, Sail. Amino acid residues are represented by the standard 
one-tetter code. Residues submitted to site-directed mutagenesis (K58, 
K65 and W242) are boxed. 

1 2 



a 

b 

c 

d— - 

e 

f 



Fig. 2. SDS-PAGE (12.5%) of purified recombinant adenylate 
cyclase expressed in E.colL Lane 1: M r markers: a, phosphorylase a, 
(94 000); b, bovine serum albumin (67 000); c, ovalbumin (43 000); 
d. carbonic anhydrase (30 000): e. soybean trypsin inhibitor (20 300); 
f, lysozyme (14 400). Lane 2: purified adenylate cyclase (2 fig of 
proteins). Gels were stained with Coomassie Blue. 

cyclolysin and the or domain of 0-galactosidase. Since the 
transcription and translation signals were identical in different 
plasmids, this higher activity was probably due to a better 
stability of the protein. This clone was therefore kept for 
further studies. 

The maximum level of adenylate cyclase activity was 
reached when cells entered the early stationary phase, after 
which a decrease of activity was noted, probably due to the 
degradation of the protein. During the exponential phase of 
growth, the ratio of /3-galactosidase to adenylate cyclase 
activity was constant, as expected for a-complementatio'n 
(interaction between peptide a carried by the recombinant 
adenylate cyclase and the partially deleted 0-galactosidase 
coded by the /acZM15 gene). The two enzyme activities 



Step 


Protein 
(mg) 


Total activity 
(fimol/min) 


Specific activity 
(/imol/min mg 
of protein) 


Yield 
(%) 


Whole cell extract 
8000 g sediment 


889 
138 


2410 
1968 


2.7 
14.2 


100 
82 


in 8 M urea 
DEAE-Sephacel 

chromatography 
CaM-agarose 

chromatography 


17.8 
0.35 


1032 
738 


60.0 
2104 


43 
31 



Bacteria from 2 I of culture were suspended in 50 ml of 20 mM 
K-phosphate (pH 7.4) then disrupted by ultrasound. The extract was 
centrifuged at 8000 g for 30 min and the sediment resuspended in 
20 ml of 8 M urea in Tris-Triton buffer [50 mM Tns-HCl, pH 8 
plus 0.1% (w/w) Triton X-100]. After removal of insoluble material 
by centrifugal at 13 500 g for 30 min. the 'urea extract was loaded 
onto a DEAE-Sephacel column (20 mg of protein/ml of swollen gel) 
at a flow rate of 30 ml/h. The column was washed with 150 ml ot 
Tris-Triton buffer for 1.5 h. then adenylate cyclase was eluted with 
05 M NaCI in the same buffer. Fractions containing adenylate cyclase 
were supplemented with 5 mM CaCfe then purified on CaM-agarose 
as described by Monneron et at. (1988). 
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Fie. 3. Trypsin digestion of recombinant adenylate cyclase in the 
presence and absence of CaM. (A) Seven units of purified recombinant 
adenylate cyclase in 100 ,d of Tris-Triton buffer (50 «^T«-H.a 
pH 8 plus 0.1% Triton X-100) supplemented with I mM CaCl^ and 
50 ng TPCK-trypsin were incubated at 4°C in the presence of 2 jiM 
CaM ( • ■) or 0.5 mg/ml human serum albumin ( ♦ .). At different 
time intervals 5 *J aliquots were withdrawn and diluted in 200 p. of 
Tris-Triton buffer containing 2 pg of soybean trypsin inhibitor ( • , 
♦ ) or in 35 id of 8 M urea in the same buffer (■). After 10 min at 
4°C samples in urea were diluted to 200 pi with Tris-Tnton buffer, 
then residual adenylate cyclase activity (expressed as percentage ot 
initial activity) was determined. Controls run in the absence of trypsin 
and diluted with Tris-Triton buffer or 8 M urea Tris-Triton showed 
no decreased enzyme activity. (B) 0.1 units of pure ['-^adenylate 
cyclase (7 x 10 5 c.p.m.) in 50 pi of Tris -Triton buffer containing 
I mM CaCU were submitted to TPCK-trypsin (25 ng) proteolysis for 
15 s (lane 3), 3 min (lane 4), 5 min (lane 2) or 7 min (lane 5) in the 
absence (lane 2) or in the presence of 0.15 pM CaM (lanes 3, 4 and 
5). Proteolysis was stopped with an excess of soybean trypsin inhibitor 
(2 pg/sample). Residual adenylate cyclase activity (expressed as 
percentage of initial activity on the top of the gel lanes) was 
determined for each sample. The remaining samples were run onto 
12 5% (w/v) SDS-PAGE and autoradiographed. The arrows to the 
right indicate the positions of the undigested protein (lane 1, 100% 
activity), the 43 kd form of adenylate cyclase, and the two 
complementary peptides resulted from trypsin digestion of B.pertussis 
adenylate cyclase (25 and 18 kd. respectively). 
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Fig. 4. A helix wheel representation of the putative CaM-binding 
sequence situated between residues 235 and 246 in B.pertussis 
adenylate cyclase. 



Table H. Adenylate cyclase acti vity of crude extracts 
Strain 



Activity (nmol/min me of protein ) 



10 nM CaM 



2 fiM CaM 



WT (5202) 


2610 


2540 


W242D 


87 


1330 


W242G 


480 


2220 


W242R 


1465 


2430 


W242V 


2530 


2820 


K65Q 


1.51 


1.40 


K5SQ 


0.55 


0.59 



„„ „,o.u F . cu uy unrasouna ana me extract was diluted with 

8 M urea in Tns —Triton buffer to "unmask" cryptic adenylate cyclase 
□cavity. K65Q and K58Q muunt extracts were analysed for enzyme 
activity using incubation times of I h instead of 5 or 10 min. 

were recovered in the pellet fraction after cell breakage and 
centrifugation. 

Formation of adenylate cyclase -£-galactosidase aggre- 
gates was found to be helpful in enzyme purification, 
insoluble proteins once treated with urea in buffer solution 
contained >80% of the original adenylate cyclase activity 
Chromatography on DEAE-Sephacel allowed removal of 
the chaotropic agent and recovery of a fully active enzyme 
by elution with salt. Finally, affinity chromatography on 
CaM-agarose yielded pure adenylate cyclase (Table I and 
Figure 2). The mol. wt of the protein calculated from a 
calibration curve is at 60 kd, somewhat lower than that 
S, f ^n^ d I r0m the deduced sequence of the protein (M r = 
64 995). Several faint bands, visible between 60 and 43 kd 
correspond most probably to products of cleavage of the 
recombinant protein by endogenous proteases. Antibodies 
directed against adenylate cyclase of B. pertussis (Monneron 
et ai., 1988) recognized these polypeptides in Western blots 
(data not shown). 

Characterization of the B.pertussis adenylate cyclase 
expressed in E.coli 

E.coli carrying the truncated gene of B.pertussis adenylate 
cyclase expressed not only an insoluble, but also a cryptic 
form of the enzyme after growth at 37°C, most probably 
due to the fact that adenosine 5 '-triphosphate (ATP) and/or 
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Fig. 5. Activation by calmodulin of wild-type (□) and different mutant 
forms (W242V, ■; W242R. W242D. A; W242G. O; and 
K65Q, A) of recombinant adenylate cvclase expressed in E.coli 
Bovine brain calmodulin (100 pM stock solution) was diluted to the 
desired concentrations with Tris -Triton buffer con tain in e 1 mg/ml 
human serum albumin and then added in 5 /il-aliquots to~the reaction 
medium described in Materials and methods. CaCI, was present at 
0.1 mM. The reaction was initiated by the addition of adenylate 
cyclase followed by 10-60 min incubation at 30°C. The fraction of 
maximal activation was calculated as (^-^oVC^max - ^ ) where V 
and ^max are the reaction rates in the absence of CaM or in the 
presence of saturating concentrations of CaM. respectively; V is the 
reaction rate at a given concentration of CaM. The wild-type and 
K65Q mutant enzymes were used as purified preparations; the Trp 
mutants were derived from crude bacterial extracts. 

CaM were less accessible to the cyclase moiety of the 
complemented protein aggregates. Dilution of crude extracts 
with 8 M urea instead of buffer prior to activity assays 
unmasked this cryptic activity. Trypsin rapidly inactivated 
the recombinant adenylate cyclase in the absence of CaM 
as was the case with enzyme secreted from B.pertussis 
(Figure 3A). Proteolysis performed in the presence of CaM 
converted the 60 kd protein to a 43 kd form with no loss of 
activity. Further proteolysis of the 43 kd form yielded the 
complementary fragments T25 and T18 (Ladant, 1988), still 
maintained in an active native-like structure by CaM (Figure 
3B). It is therefore clear that the a peptide of /3-galactosidase, 
as well as residues between 400 and 458 of recombinant 
adenylate cyclase, could easily be removed from the protein 
by tryptic digestion with no significant loss of activity. The 
kinetic parameters of the recombinant protein are very 
similar to those of adenylate cyclase purified from B. 
pertussis culture supernatants, i.e. a specific activity 
>2000 ^mol/min mg of protein at 30°C and pH 8, a K m 
for ATP of -0.6 mM and a K d for CaM of 0.2 nM in the 
presence of Ca- + ions. Ethylene to(oxyethylenenitrilo) 
tetraacetic acid (EGTA) decreased the affinity of recombinant 
adenylate cyclase for CaM by about two orders of 
magnitude, as was the case with enzyme secreted from 
B.pertussis (Ladant, 1988), while the maximum catalytic 
activity was increased by a factor of 1.7 (data not shown). 

Site-directed mutagenesis of Trp-242 in recombinant 
adenylate cyclase 

Attempts to identify the CaM-binding site of B.pertussis 
adenylate cyclase by sequence comparison with well-charac- 
terized CaM-activated enzymes was rather inconclusive. 
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Fig. 6. Alignment of the putative poly phosphate-binding sites of 
B.pertussis adenylate cyclase with sequences belonging to well- 
characterized ATP-binding enzymes. 

However, the sequence situated around Trp-242 in the 
bacterial enzyme (Figure 4) had some features characteristic 
of CaM-binding peptides such as presence of basic and 
hydrophobic residues forming amphiphilic a-helix structures 
(Blumenthai etaL, 1985; Buschmeier etaL, 1987; Kennelly 
et al 1987; O'Neil et aL, mi; Glaser et aL, 1988a; 
Hanley et aL, 1988; James et al., 1988). Since Trp is 
frequently present in CaM-binding sequences, Trp-242 was 
chosen as the first target for analysis of B.pertussis adenylate 
cyclase by site-directed mutagenesis. 

As shown in Table H replacement of Trp-242 in wild-type 
adenylate cyclase with Val, Arg, Gly or Asp (W242V, 
W242R, W242G and W242D mutants) had little effect 
on enzyme activity in crude extracts, as long as CaM 
concentration was well above saturation. However, at 
10 nM CaM (concentration at which the activator is almost 
saturating for the wild-type enzyme) the four mutants 
displayed considerable differences in activity: mutant W242V 
was the least affected (90% of maximal activity at 10 nM 
CaM), whereas mutant W242D was the most affected (only 
6.6% of maximal activity at 10 nM CaM). Analysis of CaM 
activation curves for the four Trp mutants showed that the 
half-maximal concentration required for activation was of 
0.5 (W242V), 6.3 (W242R), 95 (W242G) and 350 nM 
(W242D), respectively, as compared to a value of 0.20 nM 
for the wild-type protein (Figure 5). As expected, the K m 
for ATP of all mutants showed no significant difference when 
compared with that of the wild-type enzyme. 

Two modified forms of adenylate cyclase (W242V and 
W242D) were purified to homogeneity to determine whether 
mutation had affected the maximum catalytic activity. The 
specific activity of the adenylate cyclase W242V was very 
close to that of the wild-type protein (2200 /tmol/mm 
mg of protein), whereas mutant W242D had a V m 30% 
lower than the wild-type enzyme. It is possible that under 
experimental conditions the latter mutant would not be 
saturated by the activator. Determination of CaM activation 
curves of purified enzymes gave the same K d values as 
those obtained with enzyme from crude extracts. 

Site-directed mutagenesis of Lys-58 and Lys-65 in 
recombinant adenylate cyclase 

Previous experiments showed that isolated T25 fragment 
(residues 1-235/237) of B.pertussis adenylate ' cyclase 
exhibited a low but measurable enzyme activity, which 
indicated that it harboured the catalytic site (Ladant et al., 
1989). In addition, it was observed, by comparison with 
another CaM-activated enzyme, Bacillus anthracis adenylate 
cyclase (Escuyer et al, 1988), that a 17 amino acid poly- 
peptide was similar in both proteins and contained the 

sequence G — GKS (AKS in B.pertussis) (Figure 6) 

which is known to be part of many ATP-binding proteins 
(Walker et aL, 1982; Miller and Amons, 1985; Fry et aL, 



1986). This suggested that the nucleotide-binding site might 
be located near Lys-65. In order to substantiate this 
hypothesis we substituted Gin for Lys-65. Another basic 
residue, Lys-58, situated in the vicinity of the former Lys 
residue was also submitted to the same substitution. Both 
mutants (K65Q and K58Q) showed drastically reduced 
catalytic activity in crude extracts as compared to wild-type 
protein at saturating or near-saturating concentrations of CaM 
(Table II). As a first approximation we can assume that both 
mutants were much less, or not at all, affected in their 
CaM-binding properties. Pure K65Q mutant adenylate 
cyclase exhibited a specific activity of 2.2 /imol/min mg of 
protein when assayed with 2 mM ATP and 100 nM CaM. 
Determination of enzyme activity as a function of ATP and 
CaM concentration indicated a K m for ATP of 1.5 and 
half-maximum concentration required for activation by CaM 
of 0.5 nM. The latter value is most probably an overestimate 
since enzyme concentration in the assay mixture was of the 
same order of magnitude to ensure accurate determination 
of its activity. As in the case of wild-type or W242 mutants, 
the adenylate cyclase K65Q was converted by proteolysis 
with trypsin into complementary T25 and T18 peptides in 
the presence of CaM (data not shown). 

Discussion 

Although adenylate cyclases from various organisms have 
been the subject of intense study for the past 15 years not 
much is known about the detailed molecular structure of 
their catalytic or regulatory centres. The genes coding for 
adenylate cyclase from two very different bacteria, B. 
pertussis (a Gram-negative organism) and B. anthracis 
(a Gram-positive bacillus), were cloned and sequenced 
(Escuyer et aL, 1988; Glaser et aL, 1988a; Mock et aL, 
1988) We observed that these enzymes although having 
similar properties, displayed only limited sequence similarity, 
except in three regions. The first region (situated between 
residues 54-70 in B.pertussis enzyme and between residues 
342-358 in B.anthracis adenylate cyclase) contained a 
sequence that resembled the generally accepted binding site 
for ATP (Walker et aL, 1982; Miller and Amons, 1985; 
Fry et aL. 1986) (Figure 6). We have demonstrated here 
in the case of B.pertussis adenylate cyclase that, indeed, this 
region must be closely involved in the enzyme active centre. 
From amino acid sequence comparison we can therefore 
confidently infer that the cognate site is also involved m the 
catalytic site of B.anthracis. The data presented are also m 
line with biochemical experiments using the pure enzyme 
fragments and suggesting that the catalytic centre was located 
in the first half of the protein (residues 1 -235/237) secreted 
in B.pertussis culture supernatants (Ladant et aL, 19$9). 

The case of the CaM regulator site is more complex. When 
analysing the amino acid sequence of the protein, as predicted 
from the nucleotide sequence, we were prompted to suggest 
a tryptophan-containing peptide (Trp-242) might be involved 
in CaM-binding as it is in some known cases (Glaser et aL, 
1988a). However, comparison with the B.anthracis enzyme 
did not display any significant primary structure similarity 
in the corresponding region. It was observed, however, that 
the region was bracketed by two regions of strong similarity 
(Escuyer et aL, 1988). This suggested either that we are in 
the presence of regions flanking a polypeptide fold where 
CaM is accommodated, and/or that one of these regions 
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contacts CaM while the other one would be involved in the 
information transfer between the CaM-binding site and the 
catalytic centre. In any case the fact that alteration of Trp-242 
results in a much lower CaM-binding demonstrates that this 
region must be somehow involved in the association between 
the two proteins. The fact that there is no conservation of 
the corresponding primary structure indicates that it is a 
secondary rather than primary structure which is important 
for binding. It should be emphasized here that our results 
are the first experimental data where the putative a-helix 
involved in CaM-binding has been modified, and thus shown 
to influence binding. 

In keeping with this hypothesis are the results of the 
specific mutation effects we observed. Aspartate, when 
replacing Trp diminishes drastically CaM interaction; it is 
known that this amino acid residue, perhaps through its 
specific interaction with water molecules, is often a helix 
breaker, whereas aromatic amino acid residues (namely Trp 
and Phe), are helix formers. A branched-chain amino acid 
such as Val should be, in this context, more like Trp and 
Gly more like Asp, as they indeed are. Thus we propose, 
in line with Cox et al. (1985) that an a-helix is part of the 
CaM-binding site of bacterial pathogen adenylate cyclase, 
as is the case for other CaM-binding proteins. This could 
correspond to a dipolar moment orienting CaM in a correct 
position with respect to the enzyme; the conserved sites 
would in such a picture be more likely to correspond to zones 
of contact between the enzyme and CaM. 

Materials and methods 

Chemicals 

Adenine nucleotides, restriction enzymes and T 4 DNA ligase were from 
Boehringer Mannheim. L-l-tosylarnido-2-phenyleihyl chloromediyl ketone 
(TPCK) trypsin, soybean trypsin inhibitor, bovine brain CaM and CaM- 
agarose were from Sigma. Oligonucleotides were synthesized according to 
the phosphamidinate method using a commercial DNA synthesizer (Applied 
Biosystems). [a- 32 P]ATP (3000 Ci/mmoi). [ 3 HJcAMP {40 Ci/mmol) and 
Na l25 I (1000 Ci/mmo!) were obtained from the Radiochemical Centre 
(Amersham. UK). 

Bacterial strain and growth conditions 

E.coli strain BMH7118 was used in this work (Gronenbrun, 1976). This 
strain expresses a lacl q gene and a deleted lacL gene, /acZMI5. The 
product of the First gene represses the lactose promoter in the absence of 
the inducer (isopropyl-/3-D-thiogalactoside) (IPTG), whereas the product of 
the second gene, M15, is an acceptor in ^-complementation (Ullmann and 
Perrin, 1970). Cultures were performed in LB medium (Miller, 1972) 
supplemented with 100 /xg/ml ampicillin, and when necessary 0. 1 mM IPTG 
until the end of the exponential phase of growth. Bacteria were harvested 
by centrifugation for 30 min at 5000 g, then disrupted by sonication 
(2 x3 min at 20 KHz and 100 warts) after being resuspended in 20 mM 
K-phosphate buffer (pH 7.4). 

Plasmids 

Plasmid pDIA5202 harbours a truncated cya gene, fused in frame after the 
459th codon with the a-lacZ* gene (Figure 1). The fusion gene encodes 
a 604 residue long protein. The fusion to the a segment of /J-galactosidase 
increased the stability of the adenylate cyclase, allowing a simplified 
purification of the protein. 

Site-directed mutagenesis and sequence analysis 
Oligonucleotide directed mutagenesis was performed using the Amersham 
kit following the supplier's instructions. The Lys (AAA) codon at position 
58 was modified to a Gin (CAA) codon using the oligonucleotide: 
GTGGCCACC£AAGGATTGG. The Lys (AAG) codon at position 
65 was modified to a Gin (CAG) codon using the oligonucleotide 
GTGCACGCCCAGTCGTCCG. Both modifications were performed 
on the BamHl-EcoRV DNA fragment cloned in phage M13tgl30 
(Nakamaye and Eckstein, 1986). The Trp (UUG) codon at position 242 



was first modified to a Asp (GAC) codon using the oligonucleotide 
CTTGTTGGACAAAATCGC. This Asp codon was then modified to a Val 
(GTC) codon using the oligonucleotide CTTGTTGGTCAAAATCGC. 
Finally the Trp codon was also modified in a single step to either Gly 
(GGG), or Arg (CGG) codons using the ambiguous oligonucleotide 
GACTTGTTG(C/G)GGAAAATCGC. The modifications at position 242 
were performed on the EcoRV-EcoRl DNA fragment cloned in phage 
M13tgl30. For each mutagenesis the mutated fragment was controlled for 
the absence of any other mutations by the dideoxy nucleotide sequencing 
method (Sanger et a/., 1977). 

Analytical procedures 

Adenylate cyclase activity was measured as previously described (Ladant 
et al., 1986) in 100 ft\ of medium containing 50 mM Tris-HCl (pH 8), 
6 mM MgCU, 0.1 mM CaCl 2 (or 2 mM EGTA), bovine brain CaM 
(between 0.03 nM and 10 /tm), 0.1 mM [ 3 H]cAMP (10 4 c.p.m./assay) and 
2 mM [a 32 PJATP (5 x 10 5 c.p.m./assay). One unit of adenylate cyclase 
activity corresponds to 1 /imoi of cAMP formed in 1 min at 30 °C and pH 
8. /3-gaiactosidase activity was measured according to Pardee et al. (1959). 
Iodination of adenylate cyclase (0.08—0.25 mol of l25 I/mol of enzyme) and 
cleavage of protein by trypsin were performed exactly as previously described 
(Ladant et al., 1988). Cleaved products were analysed by SDS-PAGE 
as described by Laemmli (1970) and detected by exposure of dried gels 
at — 80°C to Kodak X-O-Mat AR films for 4-48 h with intensifying screens. 

Protein concentration was measured according to Bradford (1976) or by 
amino acid analysis on a Biotronik amino acid analyzer LC 5001 using a 
single column procedure. Samples containing 10— 15 pg of protein were 
hydrolysed in vacuum in 0. i ml of 6 M HCI for 20 h at 1 10°C. 
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ABSTRACT We describe a bacterial two-hybrid system 
that allows an easy in vivo screening and selection of functional 
interactions between two proteins. This genetic test is based on 
the reconstitution, in an Escherichia colicya strain, of a signal 
transduction pathway that takes advantage of the positive 
control exerted by cAMP. Two putative interacting proteins 
are genetically fused to two complementary fragments, T25 
and T18, that constitute the catalytic domain of BordeteUa 
pertussis adenylate cyclase. Association of the two-hybrid 
proteins results in functional complementation between T2S 
and T18 fragments and leads to cAMP synthesis, Cyclic AMP 
then triggers transcriptional activation of catabolic operons, 
such as lactose or maltose, that yield a characteristic pheno- 
type. In this genetic test, the involvement of a signaling 
cascade ofTers the unique property that association between 
the hybrid proteins can be spatially separated from the 
transcriptional activation readout. This permits a versatile 
design of screening procedures either for ligands that bind to 
a given "bait," as in the classical yeast two-hybrid system, or 
for molecules or mutations that block a given interaction 
between two proteins of interest. 

Most biological processes involve specific protein-protein 
interactions. General methodologies to identify interacting 
proteins or to study these interactions have been developed 
extensively. Among them, the yeast two-hybrid system cur- 
rently represents the most powerful in vivo approach to screen 
for polypeptides that could bind to a given target protein. 
Originally developed by Fields and coworkers (1*2), it utilizes 
hybrid genes to detect protein-protein interactions by means 
of activation of a reporter-gene expression (3, 4). In essence, 
the two putative protein partners are genetically fused to the 
DNA-binding domain of a transcription factor and to a tran- 
scriptional activation domain, respectively. A productive in- 
teraction between the two proteins of interest will bring the 
transcriptional activation domain in the proximity of the 
DNA-binding domain and will trigger the transcription of an 
adjacent reporter gene (usually lacZ or a nutritional marker), 
giving a screenable phenorype. Very recently, Rossi et at. (5) 
described a different approach, a mammalian "two-hybrid" 
system, which uses /3-galactosidase complementation to mon- 
itor protein-protein interactions in intact eukaryptic cells. 

A bacterial equivalent of the two-hybrid system has not yet 
been reported. Phage display (6, 7) and double-tagging assay 
(8) represent alternative approaches to screen complex librar- 
ies of proteins for direct interaction with a given ligand. 
However, these techniques do not allow an in vivo selection of 
the relevant clones. 

We describe here a two-hybrid system in Escherichia coli in 
which the proteins of interest are genetically fused to two 
complementary fragments of the catalytic domain of Borde- 

The publication costs of this article* were defrayed in pari by page charge 
payment. This article must therefore be hereby marked "advertisement" in 
accordance with 18 U.S.C. §1734 solely to indicate this fact. 

O 1998 by The National Academy of Sciences 0027-8424/98/955752-552.00/0 
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telta pertussis adenylate cyclase (9, 10). Interaction between the 
two proteins results in functional complementation between 
the two adenylate cyclase fragments leading to cAMP synthe- 
sis, which, in turn, can trigger the expression of several reside 21 
genes. Using this assay, one can select specific clones express- 
ing a protein that interacts with a given target, by a simple 
genetic screening. 

MATERIALS AND METHODS 

Strain and Growth Media. DHP1 is an adenylate cyclase- 
deficient (cya) derivative of DH1 (F~, glnV44(AS) t recAl 
endAl, gyrA96 (Nat), thil, hsdR17, spoTL rfbDI) (25) and was 
isolated by using phosphomycin as a selection antibiotic (II). 
Growth media used were the rich Luria-Bertant (LB) medium 
or the synthetic medium M63 (12) supplemented with 
carbon source. Antibiotic concentrations were as follows: 100 
Mg/ml ampicillin and 30 Mg/ml chloramphenicol. Screening 
for the ability to ferment sugars was performed either on 
MacConkey agar plates containing \% maltose, or on LB 
plates containing 40 ftg/m\ X-gal (5-bromo-4-chloro-^- 
indoiyl-/3-D-gaIactopyranoside) and 0.5 mM isopropyI-/3-r> 
thiogalactopyranoside. 

Plasmids. The plasmid pCm-AHLl is an expression vector 
for the full catalytic domain of adenylate cyclase. It was 
generated by subcloning a 1.5-kb P\>u\l fragment from 
pACTAC1322 (13) containing the first 384 codons of cya -I 
under the control of the lac UVS promoter [cAMP/catabolite 
gene activator protein (CAP)-independent], into pACYClS^ 
linearized by EcoRV-Hincll. Plasmid pT25 is a derivative of 
pACYC184 that encodes the T25 fragment of CyaA (amino 
acids 1-224) in frame with a multicloning site sequence, under 
the control of the lac UVS promoter. Plasmid pT18 is a 
derivative of pBluescript II KS (Stratagene), compatible with 
pT25, that encodes the T18 fragment of CyaA (amino acids 
225-399) in frame with the multicloning site sequence of 
pBluescript II KS. The leucine zipper region of the yeast 
protein GCN4 was amplified by PCR from plasmid pHB16 
(14) (a gift from H. Bedouelle, Institut Pasteur) using primers- 
lzipl (CTGCAGGTACCTATCCAGCGTATGAAA) and 
lzip2 (TGAGGGTACCCCACGTTCACCCACCAG). The 
amplified sequence was cleaved by Kpn\ and cloned into the 
Kpnl site of pT25 and pTIS to yield plasmids pT25-zip and 
pT18-zip, which encode, respectively, the T25 and T18 frac- 
ments fused in frame with this 35-aa-long leucine zipper. 
Plasmids pT25-Tyr and pT18-Tyr express, respectively, the T25 
and T18 fragments fused in frame with the first 302 aa of the 
dimeric tyrosyl-tRNA synthetase from Bacillus stearother- 
mophilus (15). The corresponding part of the tyrRS gene was 
amplified by PCR from plasmid M13EL (15) (a gift from H. 
Bedo uelle ) by using primers tyrSl (AGAGGTACCGGACAT- 
GGATTTGCT) and tyrS2 (GCCGGTACCGCCGCTGTCAA- 

Abbreviations: LB, Luria-Bertant; X-gal. 5-bromo-4-chloro-3-indolyl- 
0-D-gaJactopyranoside; CAP, catabolite gene activator protein. 
•To whom reprint requests should be addressed at: Unite" de Biociiimie 
. Cellulaire, Institut Paiteur. 28, rue du Docteur Roux, 75724 Paris 
* Cedcx 15, France, e- m ail :'ladant@p as teur.fr. 
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ATTGGC1 cleaved by Kpnl and cloned into the Kpnl site of 
pS and pT18 Theplasmid pT25-prpll that expresses ^the 
^25 fused'to the yeast-splicing factor Prpll (16) was ^on- 
structed as follows. First, we n*^**^^™™ 
of pT25 by inserting the sequence p CCC G GGG .° e J w "" j" e f 
Psil and BamHl site of P T25, to change the reading frame of 
Z BamHl site. Then, a 0.9-kb BamW fragment of piasm.d 
pPL253 (a gift from P. Legrain, Institut Pasteur) encornpassmg 
the prpll gene was subcloned into the corresponding site of 
PT25-2 To construct plasmid P T18-prp21, expressing T18 
Sfed to the yeast splicing factor Prp21 (16). the gene encod,„g 
the Prp21 protein was amplified by PCR from P'«™J pPL182 
(a gift from P. Legrain) by using C^°TACCGA 
TGGAACCAGAAGATAC and CCCGGTACCAGT1 1 1 1 
TACTTTTCTTTAACC, cleaved by Kpnl and cloned into the 

^Ana^c^lMJthods. p-galactosidase assays 
on toluenized bacterial suspensions, as described in Pardee^ 
al. (17). One unit of activity corresponds to 1 nmol ot 
o-nitrophenyl fl-D-galactoside hydrolyzed per mm at 28 C. 

Baaerial extractswere prepared from cells grown overn.ght 
at 30°C in LB medium supplemented with ampicllin and 
chloramphenicol. Pelleted cells were suspended .n 8_ M 
urea/20 mM Hepes-Na. P H 7.5. and disrupted by sorucat.on. 
Adenylate cyclase assays were performed onextrac^ as de- 
scribed previously (9) in the presence of 1 ftM ^"^« du '' n d 
One unit of activity corresponds to 1 nmol of A MP formed 
per min at 37°C. Cyclic AMP measurements were done by an 
EUSA assay. Briefly, a c AMP-biotinyIated-BSA conjugate 
was coated L ELISA plates, and nonspecific prote.n-bind.ng 
^cs were blocked with BSA. Boiled bacter.al cultures were 



Proc. Natl. Acad. Sci. USA 95 (1998) 5753 

then added, followed by diluted rabbit anti-cAMP ant«erum 
in 50 mM Hepes, pH 7.5/150 mM NaCl/0 1% Tween 2U 
(HBST buffer) containing 10 mg/ml BSA After overnight 
n^bation at 4-C, the plates were washed ^^TjJ 
HBST, then goat anti-rabbit IgC i coupled to alkahne phos 
ohatase (API was added and incubated for 1 hr at 30 C. After 
Sng the AP activity was revealed by S'-para-mtrophenyl 
Xspha'te cAMP concentrations were calculated from a 
Sard curve established with known concentrations of 
cAMP diluted in LB medium. 

RESULTS 

Principle. B. pertussis produces a «lmodulin-de pendent 
adenylate cyclase toxin encoded by the eyaA gene (18-20) The 
catalytic domain is located within the first 400 aa of this 
l?706-residue-long protein (10, 19). It exhibits a high catalytic 
activity (kc a , = 2,000 s'<) in the presence of calmodulin (CaM), 
and a low but detectable activity (k„, = 2 s"<) m the absence 
of this activator (9. 21). Biochemical studies revealed that the 
catalytic domain can be proteolytic^ cleaved into two com- 
plementary fragments, T25 and T18, which remain associated 
m the presence of CaM in a fully active ternary complex (9, 10 
22 In the absence of CaM, the mixture of the two fragments 
did not exhibit detectable activity, suggesting that : the : two 
fragments are not able to reassociate to yield basal CaM- 
Spendent activity. We reasoned that when expressed in an 
adenylate cyclase-deficient E. coli strain (E col, lacks CaM or 
CaM-related proteins), the T25 and T18 fragments fused o 
putative interacting proteins would reassociate and lead to 
cAMP synthesis (Fig. 1). 
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Fig. 2. Schematic representation of plasmids. The open boxes 
represent the ORFs of /3-Iactamase (bla) and chloramphenicol acetyl 
transferase (cat) genes. The solid boxes correspond to the ORF of 
cyaA\ with codon numbers indicated below. The hatched boxes 
correspond to the multicloning site sequences (MCS) that are fused at 
the indicated position of the cya ORF. The origin of replication of the 
plasmids is indicated by shaded boxes. 

Functional analysis of B. pertussis adenylate cyclase activity 
can be easily monitored in an E. coli strain deficient in 
endogenous adenylate cyclase. In E. coli, cAMP bound to the 
transcriptional activator, CAP, is a pleiotropic regulator of the 
expression of various genes, including genes involved in the 
catabolism of carbohydrates such as lactose or maltose (23). 
Hence, E. coli strains lacking cAMP are unable to ferment 
lactose or maltose. When the entire catalytic domain of CyaA 
(amino acids 1-399) is expressed in £. coli cya, under the 
transcriptional and translational control of lacZ (plasmid 
pDlA5240), its calmodulin-independent residual activity is 
sufficient to complement an adenylate cyclase-deficient strain 
and to restore its ability to ferment lactose or maltose (24). 
This can be scored either on indicator plates (i.e., LB-X-gal or 
MacConkey media supplemented with maltose) of on selective 

Table l. Analysis of complementation in DHP1 strain 
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media (minimal media supplemented with lactose or maltose 
as unique carbon source). 

Design of a Two-Hybrid System Based on Functional 
Complementation Between T25 and T18 Fragments of CyaA. 

We first constructed two compatible plasmids (derived from 
pACYC184 and pBIuescript II KS) that express either the T25 
fragment corresponding to amino acids 1-224 of CyaA or the 
T18 fragment corresponding to amino acids 225-399. A mul- 
ticloning site was fused to the C-terminal end of T25 to 
facilitate construction of fusions with foreign proteins. Simi- 
larly, the T18 fragment was fused in frame to cdacZ of 
pBIuescript II KS t downstream of its multicloning site (Fig. 2). 

The two plasmids. pT25 and pT18, were cotransformed in 
DHPI, a cya derivative of the E. coli strain DH1 (25), and 
plated on MacConkey agar supplemented with maltose. As 
expected, no spontaneous complementation between the two 
isolated (independently expressed) fragments could be de- 
tected in vivo: all the transformants were white (see Table 1). 
When the DHPI strain was transformed with a plasmid 
expressing the full catalytic domain, all colonies were red 
(Table 1). 

To test whether functional complementation between T25 
and T18 could be brought about by fusing them to interacting 
proteins, we inserted, within the multicloning site of both pT25 
and pT18, a DNA sequence that codes for a 35-aa-Iong leucine 
zipper derived from protein GCN4, a yeast transcriptional 
activator (14). When the resulting plasmids, pT25-zip and 
pT18-zip were cotransformed in DHPI and plated on Mac- 
Conkey/maltose media, the resulting colonies became red 
after 24-30 hr of growth at 30°C (Table 1). 

Control experiments were carried out in which pT25-zip was 
cotransformed with pT18 or pT18-zip was cotransformed with 
pT25. None of the transformants exhibited complementation, 
demonstrating that the functional complementation of T25-zip 
and T18-zip was mediated by the interaction of their leucine 
zipper motif. The efficiency of complementation could be 
further quantified by measuring in liquid cultures either cAMP 
levels or /3-galactosidase activities (Table 1). 

Adenylate cyclase activities of the different transformants 
were measured in ceil extracts in the presence of CaM that 
binds tightly to T25 and T18 fragments to form the active 
adenylate cyclase complex. As shown in Table 1, only the 
extract from DHPl/pT25-zip/pT18-zip exhibited a significant 
enzymatic activity. In the absence of CaM, adenylate cyclase 
activity still could be detected in the extract of DHPl/pT25- 
zip/pT18-zip, indicating that the noncovalent association of 
T25 -zip and T18-zip, mediated by their leucine zipper moiety, 
was able to restore the basal enzymatic activity, which was 
sufficient to sustain in vivo cAMP synthesis (Table 1). The lack 
of adenylate cyclase activity in the extracts of the three other 
types of transformants (Table 1) indicates that at least one of 
the two complementary fragments of adenylate cyclase was 
missing, most probably as a consequence of its in vivo proteo- 
lytic degradation. Therefore, it would appear that the associ- 
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cyclase activity* 

+CaM -CaM 
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pCm-AHLl 


Red/24 hr 


6,650 
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pT25 + pT18 


White/72 hr 
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<1 
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pT25 -r pT18-zip 


White/72 hr 
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<1 


<0.01 


pT25-zip + pT18 
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Dacicrid were gmwn in ld ai ju ^ in mc presence ot io him isopropyl p-D-tmogaiactoside plus appropriate antibiotic 
The results represent the average values obtained for at least Qve independent cultures, which differed by less than 10%. 
*nmol cAMP/min per mg protein; when present in the assays, CaM was at a concentration of 1 ^iM. 
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Table 2. Complementation between various chimeric proteins 
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Plasmids 



Phenotype on 
MacConkey/maltose 



0-Galactosidase, units/ mg 
dry weight bacteria 



cAMP, pmol/mg 
dry weight bacteria 



Red/40 hr 
White/96 hr 
White/96 hr 
White/96 hr 
White/96 hr 

Red/40 hr 



pT25-Tyr + pT18-Tyr 
pT25-Tyr + pT18 
pT25 + pT18-Tyr 
pT25-Tyr + pT18-zip 
pT25-zip + pTl8-Tyr 

hSU* gl-n^LB at 3CTC in ^presence of 0.5 mM isopropy. ^D-lhioga.actoside p.us appropna.e antibiotic, 
TTte results*cpre e sLn,?hc average values obtained for at least five independent cultures. 



2.800 
193 
183 
134 
126 
850 



580 
<10 
<10 
<10 
<10 
65 



ation of T25-zip and T18-zip, through their leucine zipper 
motif, not only resulted in their functional complementation 
but also in their stabilization. Stabilization of protein frag- 
ments (a and u> peptides) through complementation (26) has 
also been observed for 0-galactosidase (A.U., unpublished. 

results). . . . 

Screening for in Vivo Protein-Protein Interactions by Using 
Functional Complementation of T2S and T18. We then exam- 
ined whether the complementation between T25 and 1 18 
could be used to analyze interactions between proteins larger 
than the 35-residue-long leucine zipper motif. A DNA frag- 
ment that encodes the N-terminal part (residues 1-302) of the 
dimeric tyrosyl tRNA synthetase from Bacillus stearother- 
mophilus (15) was subcloned into the multicloning ; site of 
plasmids pT25 and pT18. The resulting plasmids, pT25-TyrRb 
and P T18-TyrRS, when cotransformed in DHP1, yielded red 
transformants on MacConkey/maltose. The transformants 
synthesized cAMP and expressed 0-gaIactosidase (Table 2). 
Control transformations confirmed that the TyrRS moiety was 
responsible for the functional complementation between TJ- 
TyrRS and T18-TyrRS (Table 2). Furthermore, as shown in 
Table 2, no complementation occurred when T25-TyrRS was 
cotransformed with P T18-zip or vice versa. This demonstrates 
that the complementation was dictated by the specificity of 
recognition of the polypeptides fused to the two fragments, 
T25 and T18. It also demonstrates that functional complemen- 
tation and stabilization of the chimeric proteins occur only 
upon specific interactions between the two partners. 

We further showed (Table 2) that the bacterial two-hybrid 
system could detect interaction between the yeast-splicing 
factors Prpll and Prp21 (fused to T25 and T18, respectively) 
that was characterized previously in the yeast two-hybrid assay 
(16). This demonstrates that this bacterial complementation 
assay canreveal association between eukaryotic proteins. 

To mimic a screening procedure we mixed plasmids pT 18- 
zip and P T18-TyrRS with about a 5-fold excess of pT18 and 
cotransformed this mixture in DHP1 with either pT25 or 




Fig. 3. Screening of interacting proteins with the bacterial two- 
hybrid system. DHP1 cells were cotransformed with a mixture ot 
plasmids pTI8. pT18-zip, and pT18-Tyr and either pT25 <>l) or 
pT25-zip (B), plated on LE^X-gal agar plates containing 0.5 mM 
isopropyl -3-D-thiogalactopyranoside, ampicillin, and chloramphem- 
col, and incubated for 30 hr at 3CC Note that the cya* colonies are 
larger than the cya ones. 



nT25-zip The transformants were plated on LB-X-gal. All the 
colonies cotransformed with pT25 were white (Fig. 3). Around 
20% of the colonies were blue when the cells were cotrans- 
formed with the mixture of pT18 derivatives and pT25-zip The 
plasmid DNAs of these clones were further analyzed by 
restriction mapping. As expected, the blue colonies among the 
bacteria cotransformed with pT25-zip harbored only pT18-zip. 
In another series of experiments. pT18-zip was mixed with a 
1 000-fold excess of pT18 and this mixture was transformed in 
DHP1 harboring pT25-zip and plated on MacConkey/maltose. 
Three red colonies were identified among about 3,000 white 
ones Plasmid DNA analysis of the Mai* clones confirmed the 
presence of pT18-zip. Transformation of the same mixture of 
pT18-zip/ P T18 into DHP1 harboring pT25 gave no Ma 
clones of 10,000 analyzed (data not shown). These results 
indicate that the functional complementation between the 
adenylate cyclase fragments could be used to identify inter- 
acting proteins in E. coli. 

Finally we examined whether the complementation be- 
tween T25 and T18 could be used in a selection procedure 
rather than using the screening described above DHP1 bac- 
teria cotransformed with complementing plasmids (pT25-zip/ 
pTlS-zip or P T25-TyrRS/pT18-TyrRS) were able to grow on 
minimal medium supplemented with lactose or maltose as 
unique carbon sources, whereas bacteria cotransformed with 
noncomplementing plasmids ( P T25-zip/pTl8-TyrRS or pT25- 
TyrRS/pT18-zip) did not grow (data not shown). To deter- 
mine whether this selection could be used to identify inter- 
acting proteins among an excess of noninteracttng ones, we 
performed the following "model screening" on selective me- 
dia. DHP1 bacteria harboring pT25-zip and pT18-zip (expect- 
ed phenotype: Lac + ) were mixed with a 10 5 excess of DHP 1/ 
pT25/pT18 (expected phenotype: Lac ), and then 10 cells 
from this mixture were plated on minimal medium supple- 
mented with lactose plus antibiotics. After 4-5 days at 30 C, 
100-^00 Lac + colonies appeared. Plasmid DNA analysis in- 
dicated that 18 of 20 of these colonies tested harbored 
pT25-zip and pT18-zip. When 10 7 DHPl/pT25/pT18 cells 
were plated on minimal medium/lactose, about 10 colonies 
were detected: these cells appeared to represent spontaneous 
revertants of DHP1 to a Lac* phenotype (because of either 
reversion of cya to cya* or to cAMP/CAP-independent lac 
promoter mutations). This "model screening" demonstrates 
that bacteria expressing specific interacting proteins fused to 
the adenylate cyclase fragments could be selected among a 
large number (here a 10 5 -fold excess) of irrelevant cloncs. 

DISCUSSION 

We describe here an E. coli two-hybrid system that allows 
identification of interacting proteins by a simple genetic test 
(screening and/or selection). In our approach, the m vivo 
association of two interacting proteins is coupled to the 
production of a regulatory signaling molecule, cAMP, that in 
turn triggers the expression of specific reporter genes, giving 
rjse to a selectable phenotype. 
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We took advantage of the modular structure of the catalytic 
domain of B. pertussis adenylate cyclase, which is composed of 
two complementary fragments, T25 and T18, that are both 
necessary to form an active enzyme, in the presence of CaM 
(9, 10). As shown here, the two fragments, when expressed in 
E. coli as separate entities, are unable to recognize each other 
and cannot reconstitute a functional enzyme. However, when 
T25 and T18 are fused to peptides or proteins that are able to 
interact, heterodimerization of these chimeric polypeptides 
results in a functional complementation between the adenylate 
cyclase fragments and, therefore, cAMP synthesis. Ultimately, 
cAMP, upon binding to CAP, is able to activate the transcrip- 
tion of catabolic operons, allowing the bacteria to ferment 
carbohydrates such as maltose or lactose. We have demon- 
strated that this bacterial two-hybrid system is able to reveal 
interactions between small peptides (GCN4 leucine zipper), 
bacterial (tyrosyl tRNA synthetase), or eukaryotic proteins 
(yeast Prpll/Prp21 complex). 

Several characteristics of this genetic screening and/or 
selection provide an attractive approach to search for and 
analyze interacting proteins. Because our two-hybrid system 
involves the generation of a regulatory molecule, cAMP, the 
physical association of the two putative interacting proteins, 
can be spatially separated from the transcriptional events 
(activation) that are dependent on cAMP synthesis. This 
means that the protein-protein interaction under study does 
not need to take place in the vicinity of the transcription 
machinery as is the case for the yeast two-hybrid system. 
Hence, it will be possible to analyze protein interactions that 
occur either in the cytosol (as described here) or at the 
inner-membrane level. Moreover, because this genetic screen., 
is, in essence, an assay for proximity of the fused T25 and T18 
fragments, one could anticipate that this system would be 
particularly suitable to analyze colocalization of given proteins 
within muitimolecular assemblies. 

The readout of the complementation between T25 and T18 
fragments is the transcriptional activation of cAMP/CAP- 
dependent genes. In this work, we took advantage of naturally 
occurring cAMP/CAP-dependcnt catabolic genes in E. coli. 
However, it is easy to design specific reporter cassettes in which 
any gene of interest is fused to a cAMP/CAP-dependent 
promoter. We currently are constructing such a system by using 
an antibiotic-resistance gene. This will facilitate the screening 
of complex libraries by a simple selection for antibiotic resis- 
tance. Alternatively, the reporter gene driven by cAMP/CAP 
could encode a toxic product. This could be particularly useful 
to search for chemical compounds or mutations that abolish a 
given interaction between the two studied proteins. This 
bacterial system is, therefore, particularly versatile as it offers 
the possibility of both positive and negative selections. 

That this genetic test is carried out in E. coli greatly 
facilitates the screening as well as the characterization of the 
interacting proteins. First, it is possible to use the same plasmid 
constructs to screen a library to identify a putative binding 
partner to a given "bait," and then to express the hybrid 
proteins to characterize their interaction by in vitro binding 
assays. Second, the high efficiency of transformation that can 
be achieved in E. coli allows the analysis of libraries of high 
complexity. This is particularly useful for (/) the screening of 
peptides from a library made from random DNA sequences 
that present an affinity for a given bait protein and (/<) the 
exhaustive analysis of the network of interactions between the 
proteins of a given organism (27, 28). 
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In essence, our system exploits one of the fundamental 
principles of signal transduction, that is, signal amplification. 
In the system described here we relied only on the basal 
enzymatic activity of B. pertussis adenylate cyclase, i.e., in the 
absence of CaM, which nevertheless is sufficient to synthesize 
enough cAMP to activate the transcription of the lac and mal 
operons (we estimate, from adenylate cyclase activity mea- 
sured in bacterial extracts, that about a thousand of T25 and 
T18 molecules per cell are expressed in the present design). 
This system could be rendered exquisitely sensitive by using the 
full catalytic potency of B. pertussis adenylate cyclase, i.e.. in 
the presence of its natural activator. In this case, it is antici- 
pated that reconstitution of only a few hybrid molecules per 
ceil will be sufficient to elicit a detectable signal. 
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(54) Title: PROTEIN FRAGMENT COMPLEMENTATION ASSAYS TO DETECT BIOMOLECULAR INTERACTIONS 
(57) Abstract 



We describe a strategy for designing and implementing pro- 
tein-fragment complementation assays (PCAs) to detect biomolecular 
interactions in vivo and in vitro. The design, implementation and 
broad applications of this strategy are illustrated with a large number 
of enzymes with particular detail provided for the example of murine 
dihydrofolate reductase (DHFR). Fusion peptides consisting of N and 
C-terminal fragments of murine DHFR fused to GCN4 leucine zip- 
per sequences were coexpressed in Escherichia coli grown in minimal 
medium, where the endogenous DHFR activity was inhibited with 
trimethoprim. Coexpression of the complementary fusion products 
restored colony formation. Survival only occurred when both DHFR 
fragments were present and contained leucine-zipper forming se- 
quences, demonstrating that reconstitution of enzyme activity requires 
assistance of leucine zipper formation. DHFR fragment-interface 
point mutants of increasing severity (He to Val, Ala and Gly) re- 
sulted in a sequential increase in E. coli doubling times illustrating 
the successful DHFR fragment reassembly rather that non-specific in- 
teractions between fragments. This assay could be used to study equi- 
librium and kinetic aspects of molecular interactions including pro- 
tein-protein, protein-DNA, protein-RNA, protein-carbohydrate and 
protein-small molecule interactions, for screening cDNA libraries for 
binding of a target protein with unknown proteins or libraries of small 
organic molecules for biological activity. The selection and design 
criteria applied here is developed for numerous examples of clonal 
selection, colorometric, fluorometric and other assays based on en- 
zymes whose products can be measured. The development of such 
assay systems is shown to be simple, and provides for a diverse set 
of protein fragment complementation applications. 
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TITLE OF THE INVENTION 

PROTEIN FRAGMENT COMPLEMENTATION ASSAYS TO DETECT BIOMOLECULAR INTERACTIONS 



5 FIELD OF THE INVENTIO N 

The present invention relates to the determination of the 
function of novel gene products. The invention further relates to Protein 
fragment Complementation Assays (PCA). PCAs allow for the detection 
of a wide variety of types of protein-protein, protein-RNA, protein-DNA, 
10 Protein-carbohydrate or protein-small organic molecule interactions in 
different cellular contexts appropriate to the study of such interactions. 

BACKGRO UND OF THE INVENTION 

Many processes in biology, including transcription, 
1 5 translation, and metabolic or signal transduction pathways, are mediated 
by noN-covalently-associated multienzyme complexes 1 ' 10 \ The formation 
of multiprotein or protein-nucleic acid complexes produce the most 
efficient chemical machinery. Much of modern biological research is 
concerned with identifying proteins involved in cellular processes, 
determining their functions and how, when, and where they interact with 
other proteins involved in specific pathways. Further, with rapid 
advances in genome sequencing projects there is a need to develop 
strategies to define "protein linkage maps", detailed inventories of protein 
interactions that make up functional assemblies of proteins 2 3 . Despite the 
importance of understanding protein assembly in biological processes, 
there are few convenient methods for studying protein-protein interactions 
in vivo* 5 . Approaches include the use of chemical crosslinking reagents 
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and resonance energy transfer between dye-coupled protons** . A 
powerful and commonly used strategy, the yeast two-hybrid system, » 
used to identify novel protein-protein interactions and to exam.ne the 
amino acid determinants of specific protein interactions- 8 . The approach 
a „ows for rapid screening of a large number of clones, including cDNA 
Hbraries. Limitations of this technique include the fact that the interacts 
must occur in a specific context (the nucleus of S. cerevisiae), and 
generally cannot be used to distinguish induced versus const.tut.ve 

interactions. . 

Recently, a novel strategy for detecting protein-prote.n 

interactions has been demonstrated by Johnsson and Varshavsky 108 
called the ubiquitin-based split protein sensor (USPS)'. The strategy m 
based on cleavage of proteins with A/-terminal fusions to ubiquit.n by 
cytosolic proteases (ubiquitinases) that recognize its tertiary structure. 
The strategy depends on the reassembly of the tertiary structure of the 
protein ubiquitin from complementary N- and C-terminal fragments and 
crucially on the augmention of this reassembly by oligomerizafon 
domains fused to these fragments. Reassembly is detected as specific 
proteolysis of the assembled product by cytosolic proteases 
(ubiquitinases). The authors demonstrated that a fusion of a reporter 
protein-ubiquitin C-termina. fragment could also be cleaved by 
ubiquitinases, but only if co-expressed with an /v-terminal fragment of 
ubiquitin that was complementary to the C-terminal fragment. The 
^constitution of observable ubiquitinase activity only occurred if the N- 
and C-terminal fragments were bound through GCN4 leucine 
zippers 109 - 110 . The authors suggested that this "split-gene" strategy could 
be used as an in vivo assay of protein-protein interactions and analysis 
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of protein assembly kinetics in cells. Unfortunately, this strategy requires 
additional cellular factors (in this case ubiquitinases) and the detection 
method does not lend itself to high-throughput screening of cDNA 
libraries. 

Rossi, F., C. A. Charlton, and H. M. Blau (1997) Proc. 
Nat. Acad. Sci. (USA) 94, 8405-8410) have reported an assay based on 
the classical complementation of a and w fragments of b-galactosidase 
(b-gal) and induction of complementation by induced oligomerization of 
the proteins FKBP12 and the mamalian target of rapamycin by rapamycin 
in transfected C2C12 myoblast cell lines. Reconstitution of b-gal activity 
is detected using substrate fluorescein di-b-D-galactopyranoside using 
several fluorecence detection assays. While this assay bears some 
resemblance to the present invention, there are several significant 
distinguishing differences. First, this particular complementation . 
approach has been used for over thirty years in a vast number of 
applications including the detection of protein-protein interactions: 
Krevolin, M. and D. Kates (1993) U.S. Patent No. 5,362,625) teaches the 
use of this complementation to detect protein-protein interactions. Also 
achievement of b-gal complementation in mamalian cells has previously 
been reported (Moosmann, P. and S. Rusconi (1996) Nucl. Acids Res. 
24, 1171-1172). The individual PCAs presented here are completely de 
novo designed interaction detection assays, not described in any way 
previously except for publications arising from applicants laboratory. 
Secondly, this application describes a general strategy to develop 
molecular interaction assays from a large number of enzyme or protein 
detectors, all de novo designed assays, whereas the b-gal assay is not 
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novel, nor are any general strategies or advancements over previosly well 

documented applications given. 

As in the USPS, the yeast-two hybrid strategy requires 
additional cellular machinery for detection that exist only in specific 
cellular compartments. There is therefore a need for a detection system 
which uses the ^constitution of a specific enzyme activity from fragments 
as the assay itself, without the requirement for other proteins for the 
detection of the activity. Preferably, the assay would involve an 
oligomerization-assisted complementation of fragments of monomeric or 
multimeric enzymes that require no other proteins for the detection of 
their activity. Furthermore, if the structure of an enzyme were known .t 
would be possible to design fragments of the enzyme to ensure that the 
reassembled fragments would be active and to introduce mutations to 
alter the stringency of detection of reassembly. However, knowledge of 
structure is not a prerequesite to the design of complementing fragments, 
as will be explained below. The flexibility allowed in the design of such 
an approach would make it applicable to situations where other detection 

systems may not be suitable. 

Recent advances in human genomics research has 

led to rapid progress in the identification of novel genes. In applications 
to biological and pharmaceutical research, there is now the pressing need 
to determine the functions of novel gene products; for example, for genes 
shown to be involved in disease phenotypes. It is in addressing 
questions of function where genomics-based pharmaceutical research 
becomes bogged down and there is now the need for advances in the 
development of simple and automatable functional assays. A first step 
in defining the function of a novel gene is to determine its interactions with 
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other gene products in an appropriate context; that is, since proteins 
make specific interactions with other proteins or other biopolymers as part 
of functional assemblies, an appropriate way to examine the function of 
a novel gene is to determine its physical relationships with the products 
5 of other genes. 

Screening techniques for protein interactions, such as 
the yeast "two-hybrid" system, have transformed molecular biology, but 
can only be used to study specific types of constitutively interacting 
proteins or interactions of proteins with other molecules, in narrowly 

10 defined cellular and compartmental contexts and require a complex 
cellular machinery (transcription) to work. To rationally screen for protein 
interactions within the context of a specific problem requires more flexible 
approaches. Specifically, assays that meet criteria necessary not only 
to detecting molecular interactions, but also to validating these 

15 interactions as specific and biologically relevant. 

A list of assay characteristics that meet such criteria are 

as follows: 

1) Allow for the detection of protein-protein, protein-DNA/RNA or protein- 
drug interactions in vivo or in vitro: 
20 2) Allow for the detection of these interactions in appropriate contexts, 
such as within a specific organism, cell type, cellular compartment, or 
organelle. 

3) Allow for the detection of induced versus constitutive protein-protein 
interactions (such as by a cell growth or inhibitory factor). 
25 4) To be able to distinguish specific versus non-specific protein-protein 
interactions by controlling the sensitivity of the assay. 
5) Allow for the detection of the kinetics of protein assembly in cells. 
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6) Allow for screening of cDNA, small organic molecule, or DNA or RNA 
libraries for molecular interactions. 

cl imiuiarv OF THE INVENTION 

5 The present invention seeks to provide the above- 

mentioned needs for which the prior art is silent. The present invention 
provides a general strategy for detecting protein interactions with other 
biopolymers including other proteins, nucleic acids, carbohydrates or for 
screening small molecule libraries for compounds of potential therapeutic 
10 value. In a preferred embodiment, the instant invention seeks to provide 
an oligomerization-assisted complementation of fragments of monomeric 
enzymes that require no other proteins for the detection of their activity. 
In one such embodiment, a protein-fragment complementation assay 
(PCA) based on ^constitution of dihydrofolate reductase activity by 
15 complementation of defined fragments of the enzyme in E. coli is 
hereby provided. This assay requires no additional endogenous factors 
for detecting specific protein-protein interactions (i.e. leucine zipper 
interactions) and can be conveniently extended to screening cDNA, 
nucleic acid, small molecule or protein design libraries for molecular 
20 interactions. In addition, the assay can also be adapted for detection of 
protein interactions in any cellular context or compartment and be used 
to distinguish between induced versus constitutive protein interactions in 
both prokaryotic and eukaryotic systems. 

One particular strategy for designing a protein 
25 complementation assay (PCA) is based on using the following 
characteristics: 1) A protein or enzyme that is relatively small and 
monomeric, 2) for which there is a large literature of structural and 
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functional information, 3) for which simple assays exist for the 
reconstitution of the protein or activity of the enzyme, both in vivo and in 
vitro, and4)forwhichoverexpression ineukaryotic and prokaryotic cells 
has been demonstrated. If these criteria are met, the structure of the 
enzyme is used to decide the best position in the polypeptide chain to 
split the gene in two, based on the following criteria: 1) The fragments 
should result in subdomains of continuous polypeptide; that is, the 
resulting fragments will not disrupt the subdomain structure of the protein, 
2) the catalytic and cofactor binding sites should all be contained in one 
fragment, and 3) resulting new N- and C-termini should be on the same 
face of the protein to avoid the need for long peptide linkers and allow for 
studies of orientation-dependence of protein binding. 

It should be understood that the above mentioned 
criteria do not all need to be satisfied for a proper working of the present 
invention. It is an advantage that the enzyme be small, preferably 
between 10-40 kDa. Although monomeric enzymes are preferred, 
multimeric enzymes can also be envisaged as within the scope of the 
present invention. The dimeric protein tyrosinase can be used in the 
instant assay. The information on the structure of the enzyme provides an 
additional advantage in designing the PCA, but is not necessary. Indeed, 
an additional strategy, to develop PCAs is presented, based on a 
combination of exonuclease digestion-generated protein fragements 
followed by directed protein evolution in application to the enzyme 
aminoglycoside kinase. Although the overexpression in prokaryotic cells 
is preferred it is not a necessity. It will be understood to the skilled artisan 
that the enzyme catalytic site (of the chosen enzyme) does not absolutely 
need to be on same molecule. 
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The present application explains the rationale and 
criteria for using a particular enzyme in a PCA. Figure 1 shows a 
general description of a PCA. The gene for a protein or enzyme » 
rationally dissected into two or more fragments. Using molecular biology 
techniques, the chosen fragments are subcloned, and to the 5' ends of 
each proteins that either are known or thought to interact are fused. Co- 
transfection or transformation these DNA constructs into cells is then 
carried out. Reassembly of the probe protein or enzyme from its 
fragments is catalyzed by the binding of the test proteins to each other, 
and reconstitute is observed with some assay. It is crucial to understand 
that these assays will only work if the fused, interacting proteins catalyze 
the reassembly of the enzyme. That is, observation of reconstituted 
enzyme activity must be a measure of the interaction of the fused 
proteins. 

A preferred embodiment of the present invention 
focuses on a PCA based on the enzyme dihydrofolate reductase. 
Expansion of the strategy to include assays in eukaryotic, cells, library 
screening, and a specific application to problems concerning the study of 
integrated biochemical pathways such as signal transduction pathways, 
is presented. Additional assays, including those based on enzymes that 
can act as dominant or recesive drug selection or metabolic salvage 
pathways are disclosed. In addition, PCAs based on enzymes that will 
produce a colored or fluorescent product are also disclosed. The present 
invention teaches how the PCA strategy can be both generalized and 
automated for functional testing of novel genes, screening of natural 
products or compound libraries for pharmacological activity and 
identification of novel gene products that interact with DNA, RNA or 
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carbohydrates are disclosed. It also teaches how the PCA strategy can 
be applied to identifying natural products or small molecules from 
compound libraries of potential therapeutic value that can inhibit or 
activate such molecular interactions and how enzyme substrates and 
small molecule inhibitors of enzymes can be identified. Finally, it teaches 
how the PCA strategy can be used to perform protein engineering 
experiments that could lead to designed enzymes with industrial 
applications or peptides with biological activity. 

Simple strategies to design and implement assays for 
detecting protein interactions in vivo are disclosed herein. We have 
designed complementary fragments of the native mDHFR that, when 
coexpressed in E. coli grown in minimal medium, allow for survival of 
clones expressing the two fragments, where the basal activity of the 
endogenous bacterial DHFR is inhibited by the competitive inhibitor 
trimethoprim (Fig. 3). Reconstitution of activity only occurred when both 
N- and C-terminal fragments of DHFR were coexpressed as C-terminal 
fusions to GCN4 leucine zipper sequences, indicating that reassembly of 
the fragments requires formation of a leucine zipper between the N- and 
C-terminal fusion peptides. The sequential increase in cell doubling 
times resulting from the destabilizing mutations directed at the assembly 
interface (Ile1 14 to Val, Ala or Gly) demonstrates that the observed cell 
survival under selective conditions is a result of the specific, leucine- 
zipper-assisted association of mDHFR fragment^ ,2] with fragment[3], as 
opposed to nonspecific interactions of Z-F[3] with Z-F[1,2]. several 
detailed and many additional examples are given. 

As demonstrated previously with the ubiquitin-based 
split protein sensor (USPS) 9 , a protein-fragment complementation 
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strategy can be used to study equilibrium and kinetic aspects of protein- 
protein interactions in vivo. The DHFR and other PCAs however, are 
simpler assays. They are complete systems; no additional endogenous 
factors are necessary and the results of complementation are observed 
directly, with no further manipulation. The E. coli cell survival assay 
described herein should therefore be particularly useful for screening 
cDNA libraries for protein-protein interactions. mDHFR expression in 
cells can be monitored by binding of fluorescent high-affinity substrate 

analogues for DHFR 26 . 

There are several further aspects of the PCAs that 
distinguish them from all other strategies for studying protein-protein 
interactions in vivo (except USPS). We have designed complementary 
fragments of enzymes that allow for controlling the stringency of the 
assay, and could be used to obtain estimates of the kinetics and 
equilibrium constants for association of two proteins. For example, with 
DHFR the point mutations of the wild-type enzyme He 1 14 to Val, Ala, or 
Gly alter the stringency of reconstitution of DHFR activity. For 
determining estimates of equilibrium and kinetic parameters for a specific 
protein-protein interaction, one could perform a series of DHFR PCA 
experiments with two proteins that interact with a known affinity, using the 
wild type or destabilizing mutant DHFR fragments. Comparison of cell 
growth rates in this model system with rates for a DHFR PCA using 
unknowns would give an estimate of the strength of the unknown 
interaction. 

It should be understood that the present invention 
should not be limited to the DHFR or other PCAs presented, as it is only 
non-limiting embodiments of the protein complementation assay of the 
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present invention. Moreover, the PCAs should not be limited in the 
context in which they could be used. Constructs could be designed for 
targeting the PCA fusions to specific compartments in the cell by addition 
of signaling peptide sequences 27 28 . Induced versus constitutive protein- 
5 protein interactions could be distinguished by a eukaryotic version of the 
PCA, in the case of an interaction that is triggered by a biochemical 
event. Also, the system could be adapted for use in screening for novel, 
induced protein-molecular associations between a target protein and an 
expression library. 

10 The instant invention is also directed to a method for 

detecting biomolecular interactions said method comprising: 

(a) selecting an appropriate reporter molecule; 

(b) effecting fragmentation of said reporter molecule 
such that said fragmentation results in reversible loss of reporter function; 

15 (°) fusing or attaching fragments of said reporter 

molecule separately to other molecules; followed by 

(d) reassociation of said reporter fragments through 
interactions of the molecules that are fused to said fragments. 

The invention also provides molecular fragment 

20 complementation assays for the detection of molecular interactions 
comprising a reassembly of separate fragments of a molecule, wherein 
reassembly of said fragments is operated by the interaction of molecular 
domains fused to each fragment of said molecules, and wherein 
reassembly of the fragments, is independent of other molecular 

25 processes. 

In another aspect, the present invention is directed to a 
method of testing biomolecular interactions comprising: 
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a) generating a first fusion product comprising 

i) a first fragment of a first molecule and 

ii) a second molecule which is different or the 

same as said first molecule; 
5 b) generating a second fusion product comprising 

i) a second fragment of said first molecule; and 

ii) a third molecule which is different from or the 
same as said first molecule or second molecule; 

c) allowing the first and second fusion products to 

10 contact each other; and 

d) testing for activity regained by association of the 
recombined fragments of the first molecule, wherein said reassociation is 
mediated by interaction of the second and third molecules. 

In another novel feature, the invention is directed to a 
15 method comprising an assay where fragments of a first molecule are 
fused to a second molecule and fragment association is detected by 
reconstitution of the first molecule's activity. 

The present invention also provides a composition 
comprising a product selected from the group consisting of: 
20 (a) a first fusion product comprising: 

1) a first fragment of a first molecule whose 
fragments can exhibit a detectable activity when associated and 

2) a second molecule that can bind (a)(1); 
(b) a second fusion product comprising 

25 1 ) a second fragment of said first molecule and 

2) a third molecule that can bind (b)(1 ); and 
c) both (a) and (b). 
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The invention further provides a composition comprising 
complementary fragments of a first molecule, each fused to a separate 
fragment of a second molecule. 

The inventors of the present subject matter further 
5 provide a composition comprising a nucleic acid molecule coding for a 
fusion product, which molecule comprises sequences coding for a 
product selected from the group consisting of: 

(a) a first fusion product comprising: 

1 ) fragments of a first molecule whose fragments 
10 can exhibit a detectable activity when associated and 

2) a second molecule fused to the fragment of the 

first molecule; 

(b) a second fusion product comprising 

1) a second fragment of said first molecule and 
15 2) a second or third molecule; and 

(c) both (a) and (b). 

The present invention is also directed to a method of 
testing for biomolecular interactions associated with: (a) complementary 
fragments of a first molecule whose fragments can exhibit a detectable 
20 activity when associated or (b) binding of two protein-protein interacting 
domains, from a second or third molecule, said method comprising: 

1) creating a fusion of 

(a) a first fragment of a first molecule whose 
fragments can exhibit a detectable activity when associated and 
25 (b) a first protein-protein interacting domain; 

2) creating a fusion of- 

(a) a second fragment of said first molecule and 
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(b) a second protein-protein interacting domain 
that can bind said first protein-protein interacting domain; 

3) allowing the fusions of (1 ) and (2) to contact each 

other; and 

5 4) testing for said activity. 

The instant inevntion further provides a composition 
comprising a product selected from the group consisting of. 

(a) a first fusion product comprising: 

1 ) a first fragment of a molecule whose fragments 
10 can exhibit a detectable activity when associated and 

2) a first protein-protein interacting domain; 

(b) a second fusion product comprising 

1 ) a second fragment of said first molecule and 

2) a second protein-protein interacting domain 
15 that can bind said first protein-protein interacting domain; and 

(c) both (a) and (b). 

The invention is also directed to a composition 
comprising a nucleic acid molecule coding for a fusion product, which 
molecule comprises sequences coding for either. 
20 (a) a first fusion product comprising: 

1 ) a first fragment of a molecule whose fragments 
can exhibit a detectable activity when associated and 

2) a first protein-protein interacting domain; or 
(b) a second fusion product comprising 

25 1) a second fragment of said molecule and 

2) a second protein-protein interacting domain 
that can bind said first protein-protein interacting domain; or 
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(c) both (a) and (b). 

The invention also provides a method of detecting 
kinetics of protein assembly and screening cDNA libraries comprising 
performing PCA. 

In another embodiment, the invention further provides 
a method of testing the ability of a compound to inhibit molecular 
interactions in a PCA comprising performing a PCA in the presence of 
said compound and correlating any inhibition with said presence. 

In a further embodiment,, the invention provides a 
method for detecting protein-protein interactions in living organisms and 
or cells, which method comprises: 

(a) synthesizing probe protein fragments from an 
enzyme which enables dominant selection by dissecting the gene coding 
for the enzyme into at least two fragments; 

(b) constructing fusion proteins with one or more - 
molecules that are to be tested for interactions; 

(c) fusing the proteins obtained in (b) with one or more 
of the probe fragments; 

(d) coexpressing the fusion proteins; and 

(e) detecting the reconstitution of enzyme activity. 
The invention still provides a method for detecting 

biomolecular interactions said method comprising: 

(a) selecting an appropriate reporter molecule; 

(b) effecting fragmentation of said reporter molecule; 

(c) fusing or attaching fragments of said reporter 
molecule separately to other molecules; followed by 
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(d) reassociation of said reporter fragments through 
interactions of the molecules that are fused to said fragments. 

Lastly, the invention also provides a novel method of 
affecting gene therapy, which includes the step of providing the assays 
and compositions described above. 

The present invention is pionneering as it is the first 
protein complementation assay displaying such a level of simplicity and 
versatility. The exemplified embodiments are protein-fragment 
complementation assays (PCA) based on mDHFR, where a leucine 
zipper directs the reconstitution of DHFR activity. Activity was detected 
by an E. coli survival assay which is both practical and inexpensive. This 
system illustrates the use of mDHFR fragment complementation in the 
detection of leucine zipper dimerization and could be applied to the 
detection of unknown, specific protein-molecular interactions in vivo. 

It should be undertstood that the instant invention is not 
limited to the PCAs presented here, as numerous other enzymes can be 
selected and used in accordance with the teachings of the present 
invention. Examples of such markers can be found in Kaufman, (1987 
Genetic Eng. 9:155-198) and references found therein as well as table 1 

20 of this application. 

It should also be clear to the skilled artisan to which the 
present invention pertains that the invention is not limited to the use of 
leucine zippers as the two interacting molecules. Indeed, numerous other 
types of protein-molecule interactions can be used and identified in 

25 accordance with the teaching of the present invention . The known types 
of motifs involved in protein-molecular interactions are well known in the 
art. 
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The present application refers to numerous prior art 
documents and the entire contents of all those prior art documents are 
herein incorporated by reference. 

Other features and advantages of the present invention 
5 will be apparent from the following description of the preferred 
embodiments thereof, the appended Examples and from the enjoined 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 FIG. 1 provides a general description of a PCA. Using 

molecular biology techniques, the chosen fragments of the enzyme are 
subcloned, and to the 5 f ends of each, proteins that either are known or 
thought to interact are fused. Co-transfection or transformation these 
DNA constructs into cells is then carried out and reconstitution with some 

15 assay is observed. 

FIG. 2 is a scheme of the fusion constructs used in one 
of the embodiments of the invention. The hexahistidine peptide (6His), 
the homodimerizing GCN4 leucine zipper (Zipper) and mDHFR fragments 
(1, 2 and 3) are illustrated. The labels for the constructs are used to 

20 identify both the DNA constructs and the proteins expressed from these 
constructs. 

FIG. 3: (A) shows E. coli survival assay on minimal 
medium plates. Control: Left side of the plate: E. coli harboring pQE-30 
(no insert); right side; E. coli harboring pQE-16, coding for native mDHFR. 
25 Panel I: Left side of each plate: transformation with construct Z-F[1,2]; 
right side of each plate: transformation with construct Z-F[3]. Panel II: 
Cotransformation with constructs Z-F[1,2] and Z-F[3]. Panel III: 
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Cotransformation with constructs Control-F[1 ,2] and Z-F[3]. All plates 
contain 0.5 mg/ml trimethoprim. In panels I to III, plates on the right side 

contain 1mM IPTG. 

(B) E. coli survival assay using destabilizing DHFR 

5 mutants. Panel I: Cotransformation of E. coli with constructs Z-F[1 ,2] and 
Z-F[3:lle114Val]. Panel II: Cotransformation with Z-F[1,2] and Z- 
F[3:lle1 14Ala]. Inset is a 5-fold enlargement of the right-side plate. Panel 
III: Cotransformation with Z-F[1 ,2] and Z-F[3:lle1 14Gly]. All plates contain 
0.5 mg/ml trimethoprim. Plates on the right side contain 1mM IPTG. 

10 FIG. 4 features the coexpression of mDHFR fragments. 

(A) Agarose gel analysis of restriction pattern resulting from Hindi 
digestion of plasmid DNA. Lane 1 contains DNA isolated from E. coli 
cotransformed with constructs Z-F[1 ,2] and Z-F[3]. Lanes 2 and 3 contain 
DNA isolated from E. coli transformed with, respectively, construct Z-F[3] 

15 and construct Z-F[1,2]. Fragment migration (in bp) is indicated to the 
right. 

(B) SDS-PAGE analysis of mDHFR fragment 
expression. Lanes 1 to 5 show crude lysate of untransformed E. coli 
(lane 1), or E. coli expressing Z-F[1,2] (20.8 kDa; lane 2), Z-F[3] (18.4 

20 kDa; lane 3), Control-F[1 ,2] (14.2 kDa; lane 4), and Z-F[1 ,2] + Z-F[3] (lane 
5). Lane 6 shows 40 ml out of 2ml copurified Z-F[1,2] and Z-F[3]. 
Arrowheads point to the proteins of interest. Migration of molecular 
weight markers (in kDa) is indicated to the right. 

FIG. 5 illustrates the general features of a PCA based 

25 on a survival assay such as the DHFR PCA. The assay can be used in 
a bacterial or a mammalian context. The inserted target DNA can be a 
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known sequence coding for a protein (or protein domain) of interest, or 
can be a cDNA library. 

FIG. 6 represents an autoradiograph of a COS cell 
lysate after a 30 min. 35 S-Met-Cys pulse-labelling. The expression pattern 
5 is essentially identical to that observed in E. coli (see Fig. 4). The DNA 
transfected into the cells (or cotransfected) is indicated above the 
respective lanes. 

FIG. 7 illustrates the results of a protein engineering 
application of the mDHFR bacterial PCA. Two semi-random leucine 

10 zipper libraries were created (as described in the text) and each inserted 
A/-terminal to one of the mDHFR fragments. Cotransformation of the 
resulting zipper-DHFR fragment libraries in E. coli and plating on selective 
medium allowed for survival of clones harboring successfully interacting 
leucine zippers. Fourteen clones were isolated and the zippers were 

15 sequenced to identify the residues at the 6e6 and 6g6 positions. The 
6e-g0 pairs were categorized, as having attractive pairing 
(chargexharge, charge:neutral polar or neutral polar:neutrat polar) or 
repulsive pairing (chargexharge) and the number of each type of 
interaction scored for each clone. The total number of interactions for 

20 each clone is 6; the interactions are tallied on the histogram. 

Other objects, advantages and features of the present 
invention will become more apparent upon reading of the following non- 
restrictive description of preferred embodiments with reference to the 
accompanying drawings which are examplary and should not be 

25 interpreted as limiting the scope of the present invention. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Sftteistion of mDH FR for a PCA 

In designing a protein-fragment complementation assay 
(PCA), we sought to identify an enzyme for which the following is true: 1 ) 

5 An enzyme that is relatively small and monomeric, 2) for which structural 
and functional information exists, 3) for which simple assays exist for both 
in vivo and in vitro measurement, and 4) for which overexpression in 
eukaryotic and prokaryotic cells has been demonstrated. Murine DHFR 
(mDHFR) meets all of the criteria for a PCA listed above. Prokaryotic and 

10 eukaryotic DHFR is central to cellular one-carbon metabolism and is 
absolutely required for cell survival in both prokaryotes and eukaryotes. 
Specifically it catalyses the reduction of dihydrofolate to tetrahydrofolate 
for use in transfer of one-carbon units required for biosynthesis of serine, 
methionine, purines and thymidylate. The DHFRs are small (17 kD to 21 

15 kD), monomeric proteins. The crystal structures of DHFR from various 
bacterial and eukaryotic sources are known and substrate binding sites 
and active site residues have been determined 111 ' 114 , allowing for rational 
design of protein fragments. The folding, catalysis, and kinetics of a 
number of DHFRs have been studied extensively 115 " 119 . The enzyme 

20 activity can be monitored in vitro by a simple spectrophotometry assay 120 , 
or in vivo by cell survival in cells grown in the absence of DHFR end 
products. DHFR is specifically inhibited by the anti-folate drug 
trimethoprim. As mammalian DHFR has a 12000-fold lower affinity for 
trimethoprim than does bacterial DHFR 121 , growth of bacteria expressing 

25 mDHFR in the presence of trimethoprim levels lethal to bacteria is an 
efficient means of selecting for reassembly of mDHFR fragments into 
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active enzyme. High level expression of mDHFR has been demonstrated 
in transformed prokaryote or transfected eukaryotic cells 122 " 126 . 

Design Considerations. 

5 mDHFR shares high sequence identity with the human 

DHFR (hDHFR) sequence (91% identity) and is highly homologous to the 
E. colt enzyme (29% identity, 68% homology) and these sequences share 
visually superimposable tertiary structure 111 . Comparison of the crystal 
structures of mDHFR and hDHFR suggests that their active sites are 

10 essentially identical 127,128 . DHFR has been described as being formed of 
three structural fragments forming two domains 129 130 the adenine binding 
domain (residues 47 to 105 = fragment[2]) and a discontinuous domain 
(residues 1 to 46 = fragment[1] and 106 to 186 [3]; numbering according 
to the murine sequence). The folate binding pocket and the NADPH 

15 binding groove are formed mainly by residues belonging to fragments[1]- 
and [2]. Fragment [3] is not directly implicated in catalysis. 

Residues 101 to 108 of hDHFR, at the junction between 
fragment[2] and fragment[3], form a disordered loop which lies on the 
same face of the protein as both termini. We chose to cleave mDHFR 

20 between fragments [1,2] and [3], at residue 107, so as to cause minimal 
disruption of the active site and NADPH cofactor binding sites. The 
native A/- terminus of mDHFR and the novel A/-terminus created by 
cleavage occur on the same surface of the enzyme 112 ' 128 allowing for ease 
of A/-terminal covalent attachment of each fragment to associating 

25 fragments such as the leucine zippers used in this study. Using this 
system, we have obtained leucine-zipper assisted assembly of the 
mDHFR fragments into active enzyme. 
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EXAMPLE 1 
EXPERIMENTAL PROTOCOL 

DNA Constructs 

5 Mutagenic and sequencing oligonucleotides were 

purchased from Gibco BRL. Restriction endonucleases and DNA 
modifying enzymes were from Pharmacia and New England Biolabs. The 
mDHFR fragments carrying their own iN-frame stop codon were 
subcloned into pQE-32 (Qiagen), downstream from and iN-frame with the 

10 hexahistidine peptide and a GCN4 leucine zipper (Fig 1 ; Fig. 2). All final 
constructs were based on the Qiagen pQE series of vectors, which 
contain an inducible promoter-operator element (tac), a consensus 
ribosomal binding site, initiator codon and nucleotides coding for a 
hexahistidine peptide. Full-length mDHFR is expressed from pQE-16 

15 (Qiagen). 



Expression vector harboring the GCN4 leuc ine zipper 

Residues 235 to 281 of the GCN4 leucine zipper (a 
Sall/BamHI 254 bp fragment) were obtained from a yeast expression 

20 plasmid pRS316 9 . The recessed terminus at the BamHI site was filled-in 
with Klenow polymerase and the fragment was ligated to pQE-32 
linearized with Sall/Hindlll(filled-in). The product, construct Z, carries an 
open reading frame coding for the sequence Met-Arg-Gly-Ser followed by 
a hexahistidine tag and 1 3 residues preceding the GCN4 leucine zipper 

25 residues. 
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Creation of DHFR fragments: 

The eukaryotic transient expression vector, pMT3 
(derived from pMT2) 16 , was used as a template for PCR-generation of 
mDHFR containing the features allowing subcloning and separate 
5 expression of fragment[1 ,2] and fragment[3]. The megaprimer method of 
PCR mutagenesis 29 was used to generate a full-length 590 bp product. 
Oligonucleotides complementary to the nucleotide sequence coding for 
the N- and C-termini of mDHFR and containing a novel BspEI site outside 
the coding sequence were used as well as an oligonucleotide used to 
10 create a novel stop codon afterfragment[1,2], followed by a novel Spel 
site for use in subcloning fragment[3]. 

Construction of a new multiple cloning region and subcloning of 
DHFR fragments [1.2D and [3] 

15 Complementary oligonucleotides containing the novel 

restriction sites: SnaBI, Nhel, Spel and BspEI, were hybridized together 
resulting in 5* and 3' overhangs complementary to EcoRI, and inserted 
into pMT3 at a unique EcoRI site. The 590 bp PCR product (described 
above) was digested with BspEI and inserted into pMT3 linearized at 

20 BspEI, yielding construct [1,2,3]. The 610 bp BspEI/EcoNI fragment 
(coding for DHFR fragments ,2], followed by a novel stop and fragment[3] 
up to EcoNI) was filled in at EcoNI and subcloned into pMT3 opened with 
BspEI/Hpal, yielding construct F[1,2]. The 250 bp Spel/BspEI fragment 
of construct [1,2,3] coding for DHFR fragment[3] (with no in-frame stop 

25 codon) was subcloned into pMT3 opened with the same enzymes. The 
stop codon of the wild-type DHFR sequence, downstream from 
fragment[3] in pMT3, was inserted as follows. Cleavage with EcoNI, 
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present in both the inserted fragment[3] and the wild-type fragment[3], 
removal of the 683 bp intervening sequence and religation of the vector 
yielded a construct of fragment[3] with the wild-type stop codon, construct 
F[3]. 

5 

Creation of the expression c onstructs 

The 1051 bp and the 958 bp SnaBI/Xbal fragments of 
constructs F[1,2] and F[3], respectively, were subcloned into construct Z 
opened with Bglll(filled-in)/Nhel, yielding constructs Z-F[1,2] and Z-F[3] 
10 (Fig. 2). For the Control expression construct, the 180 bp Xmal/BspEI 
fragment coding for the zipper was removed from construct Z-F[1,2], 
yielding construct Control-F[1 ,2] (Fig. 2). 

Creation of Stability Mutants 

15 Site-directed mutagenesis was performed 30 to produce 

mutants at Ile1 14 (numbering of the wild-type mDHFR). The mutagenesis 
reaction was carried out on the Kpnl/BamHI fragment of construct Z-F[3] 
subcloned into pBluescript SK+ (Stratagene), using oligonucleotides that 
encode a silent mutation producing a novel BamHI site. The 206 bp 

20 Nhel/EcoNI fragment of putative mutants identified by restriction was 
subcloned back into Z-F[3]. The mutations were confirmed by DNA 
sequencing. 

E. co// Surv ival Assay 

25 E. coli strain BL21 carrying plasmid pRep4 (from 

Qiagen, for constitutive expression of the lac repressor) were made 
competent, transformed with the appropriate DNA constructs and washed 
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twice with minimal medium before plating on minimal medium plates 
containing 50 mg/ml kanamycin, 100 mg/ml ampicillin and 0.5 mg/ml 
trimethoprim. One half of each transformation mixture was plated in the . 
absence, and the second half in the presence, of 1 mM IPTG. All plates 
5 were placed at 37°C for 66 hrs. 

E. coli Growth Curves 

Colonies obtained from cotransformation were 
propagated and used to inoculate 10 ml of minimal medium 

10 supplemented with ampicillin, kanamycin as well as IPTG (1mM) and 
trimethoprim (1 mg/ml) where indicated. Cotransformants of Z-F[1,2] + 
Z-F[3:lle1 14Gly] were obtained under non-selective conditions by plating 
the transformation mixture on L-agar (+ kanamycin and ampicillin) and 
screening for the presence of the two constructs by restriction analysis. 

15 All growth curves were performed in triplicate. Aliquots were withdrawn 
periodically for measurement of optical density. Doubling time was 
calculated for early logarithmic growth (OD 600 between 0.02 and 0.2). 

Protein Overexpression and Purification 

20 Bacteria were propagated in Terrific Broth 31 in the 

presence of the appropriate antibiotics to an OD600 of approximately 1 .0. 
Expression was induced by addition of 1 mM IPTG and further incubation 
for 3 hrs. For analysis of crude extract, pellets from 150 ml of induced 
cells were lysed by boiling in loading dye. The lysates were clarified by 

25 microcentrifugation and analyzed by SDS-PAGE32. For protein 
purification, a cell pellet from 50 ml of induced E. coli cotransformed with 
constructs Z-F[1,2] and Z-F[3] was lysed by sonication, and a denaturing 
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purification of the insoluble pellet undertaken using Ni-NTA (Qiagen) as 
described by the manufacturer. The proteins were eluted with a stepwise 
imidazole gradient. The fractions were analyzed by SDS-PAGE. 

RESULTS 

Desi gn of mDHFR fra gments for a PCA 

mDHFR shares high sequence identity with the human 
DHFR (hDHFR) sequence. As the coordinates of the murine crystal 
structure were not available, we based our design considerations on the 
hDHFR structure. DHFR has been described as comprising three 
structural fragments forming two domains: the adenine binding domain 
(F[2]) and a discontinuous domain (F[1] and F[3]) 13 18 . The folate binding 
pocket and the NADPH binding groove are formed mainly by residues 
belonging to F[1] and F[2]. Residues 101 to 108 of hDHFR form a 
disordered loop which lies on the same face of the protein as both termini. 
This loop occurs at the junction between F[2] and F[3]. By cleaving 
mDHFR at residue 107. we created F[1 ,2] and F[3], thus causing minimal 
disruption of the active site and substrate binding sites. The native N- 
terminus of mDHFR and the novel A/-terminus created by cleavage were 
covalently attached to the C-termini of GCN4 leucine zippers (Fig. 1). 



g. co/i Survival Assays 

Figure 2 illustrates the general features of the expressed 
constructs and the nomenclature used in this study. Figure 3 (panel A) 
illustrates the results of cotransformation of bacteria with constructs 
coding for Z-F[1,2] and Z-F[3], in the presence of trimethoprim, clearly 
showing that colony growth under selective pressure is possible only in 
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cells expressing both fragments of mDHFR. There is no growth in the 
presence of either Z-F[1,2] or Z-F[3] alone. Induction of protein 
expression with IPTG is essential for colony growth (Fig. 3A). The 
presence of the leucine zipper on both fragments of mDHFR is essential 
5 as illustrated by cotransformation of bacteria with both vectors coding for 
mDHFR fragments, only one of which carries a leucine zipper (Fig. 3A). 
It should be noted that growth of control E. coli transformed with the full- 
length mDHFR is possible in the absence of IPTG due to low levels of 
expression in uninduced cells. 

10 Confirmation of the presence of both plasmids in 

bacteria able to grow with trimethoprim was obtained from restriction 
analysis of the plasmid DNA purified from isolated colonies. Figure 4 (A) 
reveals the presence of the 1200 bp Hindi restriction fragment from 
construct Z-F[1,2] as well as the 487 and 599 bp Hindi restriction 

15 fragments from construct Z-F[3]. Also present is the 935 bp Hindi* 
fragment of pRep4. Overexpression of the fusion proteins is illustrated 
in Figure 4 (B). In all cases, overexpression of a protein of the expected 
molecular weight is apparent on SDS-PAGE of the crude lysate. 
Purification of the coexpressed proteins under denaturing conditions 

20 yielded two bands of apparent homogeneity upon analysis by Coomassie- 
stained SDS-PAGE (Fig. 4B). 

Stability Mutants 

Applicants generated mutants of F[3] to test whether 
25 reconstitution of mDHFR activity by fragment assembly was specific. 
Protein stability can be reduced by changing the side-chain volume in the 
hydrophobic core of a protein 9 , 22 ~ 25 . Residue Ile1 14 of mDHFR occurs in 
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a core b-strand at the interface between F[1 ,2] and F[3], isolated from the 
active site, lie 1 14 is in van der Waals contact with Ile51 and Leu93 in 
F[1,2] 11 . We mutated lie 114 to Val, Ala, or Gly. Figure 3 (panel B) 
illustrates the results of cotransformation of E. coli with construct Z-F[1,2] 
5 and the mutated Z-F[3] constructs. The colonies obtained from 
cotransformation with Z-FI3:lle114Ala] grew more slowly than those 
cotransformed with Z-F[3] or Z-F[3:lle1 14Val] (see inset to Fig. 3B). No 
colony growth was detected in cells cotransformed with Z-F[3.lle1 14Gly]. 
The number of transformants obtained was not significantly different in 
I o the case where colonies were observed, implying that cells cotransformed 
with Z-F[1 ,2] and either Z-F[3], Z-F[3:lle1 14Vai] or Z-F[3:lle1 14Ala] have 
an equal survival rate. Overexpression of the mutants Z-F[3:lle1 14X] was 
in the same range as Z-F[3], as determined by Coomassie-stained SDS- 
PAGE (data not shown). 
1 5 We also compared the relative efficiency of reassembly 

of mDHFR fragments by measuring the doubling time of the 
cotransformants in liquid medium. Doubling time in minimal medium was 
constant for all transformants (data not shown). Selective pressure by 
trimethoprim in the absence of IPTG prevented growth of E. coli except 
20 when transformed with pQE-16 coding for full-length DHFR due to low 
levels of expression in uninduced cells. Induction of mDHFR fragment 
expression with IPTG allowed survival of cotransformed cells (except in 
the case of Z-F[1,2] + Z-F[3:lle1 14Gly], although the doubling times were 
significantly increased relative to growth in the absence of trimethoprim. 
25 The doubling time measured for cells expressing Z-F[1 ,2] + Z-F[3], Z- 
F[1,2] + Z-F[3:lle114Val] and Z-F[1,2] + Z-F[3:lle114Ala] were 1.6-fold, 
1 .9-fold and 4.1 -fold, higher respectively, than the doubling time of E. coli 
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expressing pQE-16 in the absence of trimethoprim and IPTG. The 
presence of IPTG unexpectedly prevented growth of E. coli transformed 
with full-length mDHFR. Growth was partially restored by addition of the 
folate metabolism end-products thymine, adenine, pantothenate, glycine 
5 and methionine (data not shown). This suggests that induced 
overexpression of mDHFR was lethal to E. coli when grown in minimal 
medium as a result of depletion of the folate pool by binding to the 
enzyme. 

In another embodiment, applicants make point 

10 mutations in the GCN4 leucine zipper of Z-F[1,2] and Z-F[3], for which 
direct equilibrium and kinetic parameters are known and correlating these 
known values with parameters derived from the PCA (Pelletier and 
Michnick, in preparation). Comparison of cell growth rates in this model 
system with rates for a DHFR PCA using unknowns would give an 

15 estimate of the strength of the unknown interaction. This should enable 
the determination of estimates of equilibrium and kinetic parameters for 
a specific protein-protein interaction. 

The present invention has illustrated and demonstrated 
a protein-fragment complementation assay (PCA) based on mDHFR, 

20 where a leucine zipper directs the reconstitution of DHFR activity. Activity 
was detected by an E. coli survival assay which is both practical and 
inexpensive. This system illustrates the use of mDHFR fragment 
complementation in the detection of leucine zipper dimerization and could 
be applied to the detection of unknown, specific protein-protein 

25 interactions in vivo. 

E. coli Aminoglycoside kinase: Optimization and Design of a PCA 
using an Exonuclease-Molecular Evolution Strategy 
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Although applicants have demonstrated that the 
engineering/design strategy described above can be used to produce 
complementary enzyme fragments, it is obvious that proteins did not 
evolve in such a way that such fragments would be expected to have 
5 optimal physical characteristics, including solubility, foldability (fast 
folding), protease resistance, or enzymatic activity. An alternative 
embodiment to the engineering/design strategy is the 
endonuclease/evolution approach. This strategy can be used by itself or 
in conjunction with the engineering/design strategy. The advantages of 
10 this approach are that in principle, prior knowledge of the protein 
strucuture is not necessary, that the optimal fragments are chosen for 
PCA and that these fragments will also have optimal characteristics. 
Following selection of optimal complementary fragments, the fragments 
are exposed to multiple rounds of random mutagenesis. Mutagenesis is 
15 acheived by suboptimal PCR combined with chemical mutagenesis or 
DNA shuffling (Stemmer, W. P. C. (1994) Proc, Natl, Acad, Sci. USA 91, 
10747-10751). The overall strategy is described for the case of 
aminoglycoside kinase (AK), an example of antibiotic resistance marker 
that can be used for dominant selection of prokaryotic cells such as E. 
20 coli or eukaryotic cells such as yeast or mammalian cell lines. The 
structure of an AK is already known, and so strategy (1) would be 
possible, however we chose to combine both strategy (1) as defined for 
DHFR above, in conjunction with strategy (2). 
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EXPERIMENTAL PROTOCOL 

The optimization/selection procedure is as follows: 

Generation of of library of AK fragments based on products of 
5 Exonuclease digestion 

Nested sets of deletions are created at the 5' and the 3' 
ends of the AK gene. In order to create unidirectional deletions, unique 
restriction sites are introduced in the regions flanking the AK gene. At the 
5' and 3' termini, an "outer" sticky site with a protruding 3' terminus (Sph 

10 I and Kpn I, respectively) and an "inner" sticky site with recessed 3' 
terminus (Bgl II and Sal I, respectively) are added by PCR. Cleavage at 
Sph I and Bgl II (or Kpn I and Sal I) results in creation of a protruding 
terminus leading back to the flanking sequence and a recessed terminus 
leading into the AK gene. Digestion with E. coli exonuclease III and S1 

15 nuclease (Henikoff, S. (1987) Methods in Enzymology 155, 156-165) 
yields a set of nested deletions from the recessed terminus only. Thus, 
10 mg of DNA is digested with Sph I and Bgl II (or Kpn I and Sal I), 
phenol-chloroform extracted, and 12.5 U exonuclease III added. At 30 
sec intervals over 10 min, aliquots are taken and put into solution with 2 

20 U S1 nuclease. The newly created ends are filled in with T4 DNA 
polymerase (0.1 U per sample) and the set of vectors closed back by 
blunt-ended ligation (10 U ligase per sample). The average length of the 
deletion at each time point is determined by restriction analysis of the 
sets. This yields sets of AK genes deleted from the 5' or the 3' termini. 

25 This manipulation is undertaken directly in the pQE-32-Zipper constructs, 
such that the products can be used directly in activity screening. 
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Screening for AK activity 

As a first step in determining the requirements for 
fragment complementation, we must determine the minimum /V-terminal 
and C-terminal fragments of AK that, alone, are active. Sets of deletions 

5 are individually transformed into E. coli BL21 cells and expression of the 
AK fragments is induced by IPTG. The sets where a significant number 
of colonies appear in the presence of G418 serve to indicate the 
approximate length of N- and C-terminal AK fragments which retain 
activity. Fragment complementation must therefore be undertaken with 

10 fragments taken from within these limits. The zipper-directed fragment 
complementation is detected as follows: appropriate sets of deletions, or 
pools of sets, are cotransformed into BL21, expression is induced with 
IPTG and growth in the presence of varying G418 concentrations is 
monitored. Large colonies which grow in the presence of high G418 

1 5 concentrations are selected as giving the most efficiently complementing 
products. 

Directed evolution of optimal AK fraaements using " DNA shuffling 

After optimal fragments have been selected, the 
20 individual fragments are removed by restriction digestion at Sph I and 
Kpn I allowing for 5' and 3' constant priming regions flanking the N- or C- 
terminal complementary fragments of AK. These oligonucleotides (2-4 
mg) are digested with DNasel (0.005 units/ul, 100 ul) and fragments of 
1 0-50 nucleotides are extracted from low melting point agarose. PCR is 
25 then performed with the fragmented DNA, using Taq polymerase (2.5 
units/ul) in a PCR mixture containing 0.2 mM dNTPs, 2.2 mM Mg2Cl (or 
0 mM for subuptimal PCR), 50 mM KCI, 10 mM Tris.HCI, pH 9.0, 0.1% 
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TritonX-100. A PCR program of 94C/60 sec; 94C 30 sec; 55C 30 sec; 
72C 30 sec times 30 to 50; 72C 5 min. Samples are taken every 5 
cycles after 25 cycles to monitor the appearance of reassembled 
complete fragments on agarose gel. The primerless PCR product is then 
5 diluted 1:40 or 1:60 and used as template for PCR with 5', 3' 
complementary constant region oligos as primers for a further 20 cycles. 
Final product is restriction digested with Sph I and Kpn I and the products 
subcloned back into pQE32-Zipper to yield the final library of expression 
plasmids. As before, E. coli BL21 cells are sequentially transformed with 

10 C-terminal or /V-terminal complementary fragment-expression vectors at 
an estimated efficiency of 109 and finally cells cotransformed with the 
complementary fragment. E. coli are grown on agarose plates containing 
1 mg/ ml G418 and after 16 hours the largest colonies are selected and 
grown in liquid medium at increasing concentrations of G418. Those 

15 clones showing the maximal resistance to G418 are then selected and if 
maximum resistance or greater is reached the evolution is terminated. 
Otherwise the DNA shuffling proceedure is repeated. Finally, optimal 
fragments are sequenced and physical properties and enzymatic activity 
are assessed. This optimized AK PCA is now ready to test for dominant 

20 selection in any other cell type including yeast and mammalian cell lines. 
This strategy can be used to develop any PCA based on enzymes that 
impart dominant or recessive selection to a drug or toxin or to enzymes 
that produce a colored or fluorescent product. In the later two cases the 
end point of the evolution process is at minimum, reatainment of signal 

25 for the intact, wild type enzyme or enhancement of the signal. This 
strategy can also be used in the absense of knowledge of the enzyme 
structure, whether the enzyme in mono-, di- or multimeric structure. 
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However, knowledge of the enzyme structure does not preclude applying 
this strategy as well, as described below. 

As can be appreciated, knowledge of the enzyme 
structure can be used to render a more efficient way of using molecular 

5 evolution to design a PCA. In this case, the enzyme structure is used to 
define minimal domains of the protein in question, as was done for DHFR. 
Instead of generating fragments of completely random length for the N- 
or C-terminal fragments, we select, during the exonuclease phase, those 
fragments that at a minimum will code for one of the two domains. For 

1 o instance, in the case of AK, two well defined domains can be discerned 
in the structure consisting of residues 1 -94 in the AMerminus and residues 
95-267 in the C-terminus. Endonuclease digestions are performed as 
above, but reaction products are selected that will minimally code for one 
of the two domains. These are then the starting points for fragment 

15 selection and evolution cycles as described above. 

Heteromeric pnyyme PCA 

A further embodiment of the invention relates to PCA 
based on using heterodimeric or heteromultimeric enzymes in which the 
20 entire catalytic machinery is contained within one independently folding 
subunit and the other subunit provides stability and/or a cofactor to the 
enzymatic subunit. In this embodiment of PCA, the regulatory subunit is 
split into complementary fragments and fused to interacting proteins. 
These fragments are co-transformed/transfected into cells along with the 
25 enzyme subunit. As with single enzyme PCA described for DHFR and 
AK, ^constitution and detection of enzyme activity is dependment on 
oligomerization domaiN-assisted reassembly of the regulatory subunit 
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reassembly into its native topology. However, the reconstituted subunit 
then interacts with the intact enzymatic subunit to produce activity. This 
approach is reminiscent of the USPS system, except it has the advantage 
that the enzyme in this case is not a constitutive cellular enzyme, but 
5 rather an exogenous gene product. As such there is no problem with 
background activity from the host cell, the enzyme can be expressed at 
higher levels than a natural gene and can also be modified to be directed 
to specific subcellular compartments (by subcloning compartment-specific 
signal peptides onto the A/- or C-termini of the enzyme and subunit. 

10 fragments). The specific advantage of this approach is that while the 
single enzyme strategy may lead to suboptimal enzymatic activity, in this 
approach, the enzyme folds independently and may in fact act as a 
chaperone to the fragmented regulatory subunit, aiding in its refolding. 
In addition, folding of the fragments may need not be complete in order 

15 to impart regulation of the enzyme. This approach is realized by a 
colorimetric/fluorometric assay we have developed based on the 
Streptomyces tyrosinase. This enzyme catalyzes the conversion of 
tyrosine to deoxyphenylaianine (DOPA). The reaction can be measured 
by conversion of fluorocinyl-tyrosine to the DOPA form. The active 

20 enzyme consists of two subunits, the catalytic domain (Melc2) and a 
copper binding domain (Meld). Meld is a small protein of 14 kD that is 
absolutely required for Melc2 activity. In the assay we are developing, the 
Meld protein is split into two fragments that serve as the 
complementation part of the PCA. These fragments, fused to 

25 oligomerization domains, are coexpressed with Melc2, and the basis of 
the assay is that Melc2 activity is dependent on complementation of the 
Meld fragments. Stoichiometries of protein complexes can also be 
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addressed (i.e. whether a complex consists of two or three proteins) as 
follows. One fuses two proteins to the two Meld fragments and a third to 
intact Melc2. It thus can be shown that the minimum complementary 
active complex of the tyrosinase will require that all three components 
and therefore a trimer is necessary. A key aspect of this approach is that 
we can easily demonstrate specific interactions by making one 
component, specifically the protein-Melc2 fusions catalytic subunit 
dependent on the other components by underexpressing it in the 
background of overexpressed Meld fragment-protein fusions. 



Mnltimer Disru ptt""-Based PCA 

Although applicants have described only fragment complementation of 
intact proteins, protein domains or subunits as comprising PCA, an 
alternate enmbodiments relates to PCAs based on the disruption of the 

15 interface between, for instance a dimeric enzyme that requires stable 
association of the subunits for catalytic activity. In such cases, selective 
or random mutagenesis at the subunit interface would disrupt the 
interaction and the basis of the assay would be that oligomerization 
domains fused to the subunits would provide the nessesary binding 

20 energy to bring the subunits together into a functional enzyme. 

Vecto r Design in App lication to PCAs 

The PCA strategies listed thus far have used two- 
plasmid transformation strategies for expression of complementary 
25 fragments. This approach has some advantages, such as using different 
drug resistance markers to select for optimal incorporation of genes, for 
instance in transformed or transfected cells or for optimum transformation 
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of complementary plasmids into bacteria and control of expression levels 
of PCA fragements using different promoters. However, single plasmid 
strategies have advantages in terms of simplicity of transfection/ 
transformation. Protein expression levels can be controlled in different 
5 ways, while drug selection can be achieved in one of two ways: In the 
case of PCAs based on survival assay using enzymes that are drug 
resistance markers themselves, such as AK, or where the enzyme 
complements a metabolic pathway, such as DHFR, no additional drug 
resistance genes need be incorporated in the expression plasmids. If 

10 however the PCA is based on an enzyme that produces a colored or 
fluorescent product, such as tyrosinase or firefly luciferase, an additional 
drug resistance gene must be expressed from the plasmid. Expression 
of PCA complementary fragments and fused cDNA libraries/target genes 
can be assembled on single plasmids as individual operons under the 

15 control of separate inducible or constitutive promotors, or can be- 
expressed polycistronically. In E. coli polycistronic expression can be 
achieved using known intercoding region sequences, for instance we use 
the region in the mel operon from which we derived the tyrosinase meld- 
melc2 genes which we have shown to be expressed at high levels in E. 

20 coli under the control of a strong (tac) promoter. Genes could also be 
expressed and induced off of independent promoters, such as tac and 
arabinose. For mammalian expression systems, single plasmid systems 
can be used for both transient or stable cell line expression and for 
constitutive or inducible expression. Further, differential control of the 

15 expression of one of the complementary fragment fusions, usually the 
bait-fused fragment, can be controlled to minimize expression. This will 
be important in reducing background non-specific interactions. Examples 
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of differential control of complementary fragment expression include the 
following strategies: 

i) In polycistronic expression, transient or stable, expression of the 
second gene will necessarily be less efficient and so this in itself could 
5 serve to limit the quantity of one of the complementary fragments. 
Alternatively, the first gene product can be limited in expression by 
mutation of an upstream donor/splice site, while the second gene can be 
put under the control of a retroviral internal initiation site, such as that of 
ECMV to enhance expression. 
1 o ii) Individual complementary fragment-fusion pairs can also be put under 
the control of inducible promoters, all comercially available including 
those based on Tet-responsive PhCMVM promoter, and/or steroid 
receptor response elements. In such a system the two complementary 
fragment genes can be turned on and expression levels controlled by 
1 5 dose dependent expression with the inducer, in these cases tetracycline 
and steroid hormones. 

EXAMPLE 2 

20 Ap plications nf the PCA s trategy to detect novel gene products in 
hin r .hftmical Pathways anrf to map such pathways 

Among the greatest advantage of PCA over other 
molecular interaction screening methods is that they are designed to be 
performed both in vivo and in any type of cell. This feature is crucial if the 
25 goal of applying a technique is to identify novel interactions from libraries 
and simultaneously be able to determine if the interactions observed are 
biologically relevant. The detailed example given below, and other 
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examples at the end of this section illustrate how it is that validation of 
interactions with PCA is possible. In essence, this is achieved as follows. 
In biochemical pathways, such as hormone receptor-mediated signaling, 
a cascade of enzyme-mediated chemical reactions are triggered by some 
5 molecular event, such as by hormone binding to its membrane surface 
receptor. Enzyme interactions with protein substrates and protein-protein 
or protein-nucleic acid interactions with enzyme-modified substrates then 
occur. Such biochemical signaling cascades only occur in specific cell 
types and model cell lines for studying these processes. Therefore, to 

10 detect induced interactions, such as with known proteins in a pathway 
with yet unidentified proteins, one obviously needs to perform such 
screening in appropriate model cell lines and in the correct cellular 
compartment. Only the PCA strategy can be used in a general way to do 
this. Protein-molecular interaction techniques such as yeast two- or 

15 three-hybrid techniques cannot be performed in a context where such 
events occur, except in the limiting case of nuclear interaction in yeast or 
interactions that are not triggered. There do exist mammalian two-hybrid 
techniques where it might be possible to detect induced protein 
interactions, but only again if the proteins involved can be simultaneously 

20 activated, transported to the nucleus and interact with their partners. 
PCAs do not have these limitation since they do not require additional 
cellular machinery available only in specific compartments. A further 
point is that by performing the PCA strategy in appropriate model cell 
types, it is also possible to introduce appropriate positive and negative 

25 controls for studying a particular pathway. For instance, for a hormone 
signaling pathway it is likely that hormone signaling agonists and 
antagonists or dominant-negative mutants of signaling cascade proteins 
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would be known, that are upstream or act in parallel to the events being 
examined in the PCA. These reagents could be used to determine if 
novel interactions detected by the PCA are biologically relevant. In 
general then, interactions that are detected only if hormone is introduced 
5 but are not seen if an antagonist is simultaneously introduced could be 
hypothesized to represent interactions relevant to the process under 
study. 

Below is a detailed description of an application of the 
DHFR that illustrates these points, as well as further examples where the 
10 PCA strategy could be used. 

Ap plication of the DHFR PCA to Mapp ing Growth Factor-Mediated 
Si gnal Transduction Pathways 

One of the earliest detectable events in growth factor- 
15 activated cell proliferation is the serine phosphorylation of the S6 protein 
of the 40S ribosomal subunit. The discovery of serine/threonine kinases 
that specifically phosphorylate S6 have considerably aided in identifying 
novel mitogen mediated signal transduction pathways. The 
serine/threonine kinase p70S6k has been identified as a specific S6 
20 phosphorylase 131 " 136 . p70S6k is activated by serine and threonine 
phosphorylation at specific sites in response to several mitogenic signals 
including serum in serum starved cells, growth factors including insulin 
and PDGF, and by mitogens such as phorbol esters. Considerable effort 
has been made over the last five years to determine how p70/p85S6k are 
25 activated in response to mitogens. Two receptor-mediated pathways have 
been implicated in p70S6k activation, one associated with the 
phosphatidylinositol-3-kinase (Pl(3)k) and the other with the Pl(3)k 



BNSDOCID: <WO 98341 20 A1> 



WO 98/34120 



PCT/CA98/00068 



41 



homologue mTOR 137 144 . Key to understanding of this proposal, is the fact 
that the role of these enzymes in activation of p70S6k was determined by 
effects of two natural products on phosphorylation and enzyme activity: 
rapamycin, which indirectly inhibits mTOR activity, and wortmannin, which 
5 directly inhibits Pl(3)k activity. It is also important to note that no direct 
upstream kinases or other regulatory proteins of p70S6k have been 
identified to this date. 

The interactions of p70S6k with its known substrate S6 
can be studied as a test system for the DHFR PCA in E. coli and in 

10 mammalian cell lines. One can also seek to identify novel interactions 
with this enzyme that would lead to new insights into how this important 
enzyme is regulated. Also, since activation of the enzyme is mediated by 
multiple pathways that can be selectively inhibited with specific drugs, this 
is an ideal system to test PCAs as methods to distinguish induced versus 

15 constitutive protein-protein interactions. 

a) Testing of the E. coli survival assav: Interaction of p70S6k with S6 

This test is ideal, because the apparent Km (= 250 nM) 
of p70S6k for S6 protein 145 is approximately the same as the Kd for 

20 leucine zipper-forming peptides from GCN4 used in our test system. 
However, we will have to use a constitutively active form of the enzyme 
for our tests. An A/-terminal truncated form of the enzyme D77-p70S6k, 
is constitutively active and will be used in these studies147. 
Methodology: D77-p70S6k-F[1, 2] fusion and D77-p70S6k-F[3] fusion, or 

25 F[1,2] and D77-p70S6k-F[3] fusion (as a control) will be cotransformed 
into E. coli and the cells grown in minimal medium in the presence of 
trimethoprim. Colonies will be selected and expanded for analysis of 
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kinase activity against 40S ribosomal subunits, and for coexpression of 
the two proteins. 

h ) Modification of the bacterial s urvival assay for library screening: 
5 Identification nf Nove l Interacting Proteins 

Screening an expression library for interactions with a 
given target (p70S6k-D77, in this case) will be straightforward in this 
system, given that the only steps involved are: 1 -construction of the 
fusion-expression library as a fusion with mDHFR fragment^]; 2- 
10 transformation of the library in E. coli BL21 harboring P Rep4 (for 
constitutive expression of the lac repressor; this is required in the case 
where a protein product is toxic to the cells) and a plasmid coding for the 
fusion: p70S6k-D77-[1,2]; 3-plating on minimal medium in the presence 
of trimethoprim and IPTG; 4-selection of any colonies that grow, 
15 propagation and isolation of plasmid DNA, followed by sequencing of 
DNA inserts; 5-purifi cation of unknown fusion products via the hexaHis- 
tag and sizing on SDS-PAGE. 

Methodology: 

20 The overall strategy is illustrated in Figure 5. 1- 

Construction of a directional fusioN-expression library: i-cDNA production: 
One can isolate poly(A)+ RNA from BA/F3 cells (B-lymphoid cells) 
because these cells have successfully been used in the study of the 
rapamycin-sensitive p70S6k activation cascade 139 . To enrich for full- 

25 length mRNA, we will affinity purify the mRNA via the 5' cap structure by 
the CAPture method 148 . Reverse transcription will be primed by a "Linker 
Primer": it has a poly(T) tail to prime from the poly(A) mRNA tail, and an 
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Xhol site for later use in directional subcloning of the fragments. The first 
strand is then methylated. After second strand synthesis and blunting of 
the products, "EcoRI Adapters" are added, pruducing digestion of the 
linkers with EcoRI and Xhol (the inserts are protected by methylation) 
5 produces full-length cDNA ready for directional insertion in a vector 
opened with EcoRI and Xhol. Because the success of library screening 
depends largely on the quality of the cDNA produced, we will use the 
above methods as they have proven to consistently produce high-quality 
cDNA libraries, ii-lnsertion of the cDNA into vectors: The library will be 

10 constructed as a C-terminal fusion to mDHFR F[3] in vector pQE-32 
(Qiagen), as we have obtained high levels of expression of mDHFR 
fusions from this vector in BL21 cells. Three such vectors will be created, 
differing at their 3' end, which is the novel polycloning site that we 
engineered (described earlier, under Methods), carrying either 0, 1, or 2 

15 additional nucleotides. This allows read-through from F[3] into the library 
fragments in all 3 translational reading frames. The cDNA fragments will 
be directionally inserted at the EcoRI and Xhol sites in all three vectors 
at once. 2, 3, 4, and 5- These steps have been described earlier, under 
Results, apart from the final sequencing of clones identified using 

20 sequencing primers specific to vector sequences flanking sites of library 
insertion. The protein purification will also be as described earlier, by a 
one-step purification on Ni-NTA (Qiagen). If the product size is more than 
15 kDa over the molecular weight of the DHFR component (equal to a 
cDNA insert of more than 450bp), we will have the inserts sequenced at 

25 the Sheldon Biotechnology Center (McGill University). 
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r ) Development of the Eu karyotic Assay 

The transformation of the system described above, is 
useful to produce an equivalent assay for use in eukaryotic cells. The 
basic principle of the assay is the same: the fragments of mDHFR are 

5 fused to associating domains, and domain association is detected by 
reconstitution of DHFR activity in eukaryotic cells (Figure 5). 
Creation of the expression constructs: The DNA fragments coding for 
the GCN4-zipper-mDHFR fragment fusions were inserted as one piece 
into pMT3, a eukaryotic transient expression vector 126 . Expression of the 

10 fusion proteins in COS cells was apparent on SDS-PAGE after 35[S]Met 
labeling. 

Survival assays in eukaryotic cells: Two systems can be used for 
detection of mDHFR reassembly, in parallel: i- CHO-DUKX B11 cells 
(Chinese Hamster Ovary cell line deficient in DHFR activity) are 

1 5 cotransfected with GCN4-zipper-mDHFR fragment fusions. The cells are 
grown in the absence of nucleotides; only cells carrying reconstituted 
DHFR will undergo normal cell division and colony formation, ii- 
Methotrexate (MTX)-resistant mutants of mDHFR have been created, with 
the goal of transfecting cells that have constitutive DHFR activity such as 

20 COS and 293 cells. We mutated F[1 ,2] in order to incorporate, one at a 
time, each of five mutations that significantly increase Ki (MTX): 
Gly15Trp, Leu22Phe, Leu22Arg, Phe31Ser and Phe34Ser (numbering 
according to the wild-type mDHFR sequence). These mutations occur at 
varying positions relative to the active site and relative to F[3], and have 

25 varying effects on Km (DHF), Km (NADPH) and Vmax of the full-length 
mammalian enzymes in which they were. Mutants Z-F[1,2: Leu22Phe], 
Z-F[1,2: Leu22Arg] and Z-F[1,2: Phe31Ser] all allowed for bacterial 
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survival with high growth rates when cotransformed with Z-F[3] (results 
not shown).The five mutants will be tested in eukaryotic cells, in 
reconstitution of mDHFR fragments to produce enzyme that can sustain 
COS or 293 cell growth while under the selective pressure of MTX, which 
5 will eliminate background due to activity of the native enzyme. The 
mutations offers an advantage in selection while presenting no apparent 
disadvantage with respect to reassembly of active enzyme. If the 
reconstituted mDHFR produced in either of the survival assays allows 
eukaryotic cell growth that is significantly slower than growth with the wild- 
1 0 type enzyme, thymidylate will be added to the growth medium to partially 
relieve the selective pressure offered by the lack of nucleotides, 
d) Testin g of the eukaryotic survival assay 

It is necessary at the outset to test whether induced 
interactions with p70S6k can be detected. One can use the same test 
1 5 system as that for the E. coli test system described above: Induction of- 
association of p70S6k with S6 protein. 
Methodology : 

mDHFR Leu22Phe mutant S6-F[1,2] and p70S6k-F[3], or F[1,2] and 
p70S6k-F[3] (as a control) will be cotransfected into COS cells and the 
cells will be serum starved for 48 hours followed by replating of cells at 
low density in serum and MTX. Colonies will be selected and expanded 
for analysis of kinase activity against 40S ribosomal subunits, and for 
coexpression of the two proteins. Further controls will be performed for 
inhibition of protein association with wortmannin and rapamycin. 
25 e) Modification of the eukaryotic survival a S sav for library sr^nin^ 

An important part of the work required in creating a 
library for use in eukaryotic cells will have been accomplished already, as 



20 
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the EcoRi/Xhol directional cDNA produced by the Stratagene "cDNA 
Synthesis Kit" can directly be inserted directionally into the Stratagene 
Zap Express vector. 
Methodology: 

Steps 1 through 5 are parallel to those for the bacterial 
library screening (above). 1-Again, the library is constructed as a C- 
terminal fusion to mDHFR F[3]. F[3] (with no stop codon) will be inserted 
in frame in Zap Express, followed by insertion of the novel polylinkers 
allowing expression of the inserts in all three reading frames (described 
above), and by the EcoRI/Xhol directional cDNA. This bacteriophage 
library will be propagated and treated with the Stratagene helper phage 
to excise a eukaryotic expression phagemid vector (pBK-CMV) carrying 
the fusion inserts. 2-Cotransfection of the library and p70S6k-F[1 ,2] 
constructs in eukaryotic cells: we will perform the screening in COS or 
293 cells, as these are responsive to serum in activating the P 70S6k 
signaling pathway. Selection experiments will be performed as described 
for the S6 test system above. 3-Propagation, isolation and sequencing 
of the insert DNA will be undertaken. 4-The cloned fusion proteins will be 
sized on SDS-PAGE by direct visualization after 35S-Met/Cys labeling, 
or by Western blotting using a commercial polyclonal antibody to mDHFR. 
Generalization of the Strategy: The scheme for detecting partners for 
the protein P 70S6k can be applied to studies of any biochemical pathway 
in any living organism. Such pathways may also be related to disease 
processes. The disease-related pathway may be an intrinsic process of 
cells in humans where a pathology arises from, for instance mutation, 
deletion or under or over expression of a gene. Alternatively the 
biochemical pathway may be one that is specific to a pathogenic 
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organism or the mechanism of host invasion. In this case, component 
proteins of such processes may be targets of a therapeutic strategy, such 
as development of drugs that inhibit invasion by the organism or a 
component enzyme in a biochemical pathway specific to the pathogenic 
5 organism. 

Inflamatory diseases are a case in point that can 
concern both examples. The protein-protein interactions that mediate the 
adhesion of leukocytes to inflamed tissues are known to involve such 
proteins as vascular cell adhesion molecule-1 (VCAM-1), and certain 

10 cytokines such as IL-6 and IL-8 that are produced during inflammation. 
However, many of the proteins involved in onset of inflammatory 
response remain unknown; further, the intracellular signaling pathways 
triggered by the extracellular associations are poorly understood. The 
PCAs could be used in elucidation of the mechanisms underlying the 

15 onset of inflammation, as well the ensuing signaling. For example, 
signaling pathways associated with inflamation, such as those mediated 
by IL-1, IL-6, IL-8 and tumor necrosis have been studied in some detail 
and many direct and downstream regulators are known. These 
regulators can be used as starting point targets in a PCA screening to 

20 identify other signalling or modulating proteins that could also be targets 
for drug development. 

There is an increased risk of infection by enteric 
pathogens in the occurrence of the intestinal inflammation that 
characterizes idiopathic intestinal diseases. There are two mechanisms 

15 which need to be better understood here and which can be addressed by 
PCA: 

i- the cellular mechanisms of inflammation as described above, and 
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ii-the discovery of the specific cell-surface ligands which the pathogenic 
organisms recognize and associate with. Secreted proteins produced by 
the pathogen can bind to the basolateral membrane of epithelial cells (as 
in the case in Yersinia pseudotuberculosis infection) or be translocated 
into intestinal epithelial cells (Salmonella infection), promoting infectivity 
and/or physiological responses to the infection. However, in most cases 
the interactions between the pathogenic protein and the epithelial cells 
are unknown. 

Cell adhesion and nervous system regeneration A related example in 
cell adhesion includes processes involved in develoment and 
regeneration in the nervous system. Cadherens are membrane proteins 
that mediates calcium dependent cell-cell adhesion. To do so they need 
another class of cytoplasmic proteins called cathenins. Those make a 
bridge between cadherins and cytoskeleton. Cathenins are also regulate 
1 5 genes that control differentiation-specific genes. For instance, the protein 
B-cathenin can interact in certain situation with a transcription factor (lef- 
1) and be translocated into the nucleus where it constrains the number of 
genes transactivated by lef-1 (differentiation). This process is regulated 
by the Wnt signaling pathway (homologs to the wingless pathway in 
20 drosophila) by inactivation of GSK3B which permit degradation after of 
APC (a cytoplasmic adapter protein). PCA strategies could be used to 
identify novel proteins involved in the regulation of these processes. 

Proteins involved in viral integration processes are 
examples of targets that could be tested for inhibitors using the PCA 
25 strategies. Examples for the HIV virus include: 
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i) inhibition of integrase or the transport of the pre-integration complex: 
protein Ma or vpr. 

ii) Inhibition of the cell cycle in G2 by vpr (interaction by cyclin B) causing 
induction of apoptosis. 

5 iii) Inhibition of the interaction of gp160 (precursor of the membrane 
proteins) with furine. 

Accessory proteins of HIV as a therapeutic target: 

i) Vpr: nuclear localizing sequence (target): interaction site of vpr with 
10 phosphatasesA . 

ii) vif: interaction with vimentin (cytoskeleton associated protein). 

ii) Vpu: Degradation of CD4 in the RE mediated by the cytoplasmic tail of 
Vpu. 

iii) nef: Myristoylation signal of Nef. 

15 

EXAMPLE 3 

Other Examples of Protein Fragment Complementation Assays 

Other examples of assays are herein examplified. The 
reason to produce these assays is to provide alternative PCA strategies 
20 that would be appropriate for specific protein association problems such 
as studying equilibrium or kinetic aspects of assembly. Also, it is possible 
that in certain contexts (for example, specific cell types) or for certain 
applications, a specific PCA will not work but an alternative one will. 
Further below are brief descriptions of each other PCAs embodiments. 
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1) Glutathione-S-Transferase (GST) GST from the flat worm 
Schistosoma japonicum is a small (28 kD), monomeric, soluble protein 
that can be expressed in both prokaryotic and eukaryotic cells. A high 
resolution crystal structure has been solved and serves as a starting point 

5 for design of a PCA. A simple and inexpensive colorimetric assay for 
GST activity has been developed consisting of the reductive conjugation 
of reduced glutathione with 1 -chloro-2,4-dinitrobenzine (CNDB), a brilliant 
yellow product. We have designed a PCA based on similar structural 
criteria used to develop the DHFR PCA using GCN4 leucine zippers as 

10 oligomerization domains. Cotransformants of zipper-GST-fragment 
fusions are expressed in E. coli on agar plates and colonies are 
transferred to nitrocellulose paper. Detection of fragment 
complementation is detected in an assay where a glutathione-CDNB 
reaction mixture is applied as an aerosol on the nitrocellulose and 

15 colonies expressing co-expressed fragments of GST are detected as 
yellow images. 

2) Green Fluorescent Protein (GFP) GFP from Aequorea victoria is 
becoming one of the most popular protein markers for gene expression. 

20 This is because the small, monomeric 238 amino-acids protein is 
intrinsically fluorescent due to the presence of an internal chromophore 
that results from the autocatalytic cyclization of the polypeptide backbone 
between residues Ser65 and Gly67 and oxidation of the - bond of Tyr66. 
The GFP chromophore absorbs light optimally at 395 nm and possesses 

25 also a second absorption maximum at 470 nm. This bi-specific absorption 
suggests the existence of two low energy conformers of the chromophore 
whose relative population depends on local environment of the 
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chromophore. A mutant Ser65Thr that eliminates isomerization (single 
absorption maximum at 488 nm) results in a 4 to 6 times more intense 
fluorescence than the wild type. Recently the structure of GFP has been 
solved by two groups, making it now a candidate for a strucutre-based 
5 PCA-design, which we have begun to develop. As with the GST assay, 
we are doing all of our initial development in E. coli with GCN4 leucine 
zipper-forming sequences as oligomerization domains. Direct detection 
of fluorescence by visual observation under broad spectrum UV light will 
be used. We will also test this system in COS cells, selecting for co- 
10 transfectants using fluorescence activated cell sorting (FACS). 

3) Fire Fly Luciferase. Firefly luciferase is a 62 kDa protein which 
catalyzes oxidation of the heterocycle luciferin. The product posesses 
one of the highest quantum yields for bioluminescent reactions: one 

15 photon is emitted for every oxidized luciferin molecule. The structure of 
luciferase has recently been solved, allowing for strucutre-based 
development of a PCA. As with our GST assay, cells are grown on a 
nitrocellulose matrix. The addition of the luciferin at the surface of the 
nitrocellulose permits it to diffuse across the cytoplasmic membranes and 

20 trigger the photoluminescent reaction. The detection is done immediately 
on a photographic film. Luciferase is an ideal candidate for a PCA: the 
detection assays are rapid, inexpensive, very sensitive, and utilizes non- 
radioactive substrate that is available commercially. The substrate of 
luciferase, luciferin, can diffuse across the cytoplasmic membrane (under 

25 acidic pH), allowing the detection of luciferase In intact cells. This 
enzyme is currently utilized as a reporter gene in a variety of expression 
systems. The expression of this protein has been well characterized in 
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bacterial, mammalian, and in plant cells, suggesting that it would provide 
a versatile PCA. 

4) Xanthine-guanine phosphoribosyl transferase (XGPRT) The E. coli 
5 enzyme XGPRT converts xanthine to xanthine monophosphate (XMP), 
a precursor of GMP. Because the mammalian enzyme hypoxanthine- 
guanine phosphoribosyl transferase HGPRT can only use hypoxanthine 
and guanine as substrates, the bacterial XGPRT can be used as a 
dominant selection assay for a PCA for cells grown in the presense of 
10 xanthine. Vectors expressing XGPRT confer the ability of mammalian 
cells to grow in selective medium containing adenine, xanthine, and 
mycophenolic acid. The function of mycophenolic acid is to inhibit de 
novo synthesis of GMP by blocking the conversion of IMP into XMP 
(Chapman A. B., (1983) Molec. & Cellul. Biol. 3, 1421-1429). The only 
15 GMP produced then come from the conversion of xanthine into XMP, 
catalyzed by the bacterial XGPRT. As with aminoglycoside 
phosphotransferase fragments of XGPRT can be generated based on the 
known structure (See table 1.) using the design-evolution strategy 
described above with fragments fused to the GCN4 leucine zippers as a 
20 test oligomerization domains. The complementary fusions are 
cotransfected and the proteins transiently expressed in COS-7 cells, or 
stability expressed in CHO cells, grown in the selective medium. In the 
case of CHO cells, colonies are collected and sequentially re-cultured at 
increasing concentrations of the selective compounds in order to enrich 
25 for populations of cells that efficiently express the fusions at high 
concentrations. 
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5) Adenosine deaminase Adenosine deaminase (ADA) is present in 
minute quantities in virtually all mammalian cell. Although it is not an 
essential enzyme for cell growth, ADA can be used in a dominant 
selection assay. It is possible to establish growth conditions in which the 
5 cells require ADA to survive. ADA catalyzes the irreversible conversion 
of cytotoxic adenine nucleosides to their respective nontoxic inosine 
analogues. By adding cytotoxic concentrations of adenosine or cytotoxic 
adenosine analogues such as 9-b-D-xylofuranosyladenine to the cells, 
ADA is required for cell growth to detoxify the cytotoxic agent. Cells that- 
10 incorporate the ADA gene can then be selected for amplification in the 
presence of low concentrations of 20-deoxycoformycin, a tight-binding 
transition state analogue inhibitor of ADA. ADA can then be used for a 
PCA based on cell survival (Kaufman, R. J. et al. (1986) Proc. of the Nat. 
Acad. Sci. (USA) 83, 3136-3140). As with the other systems described 
15 above, fragments of ADA can be generated based on the known structure . 
(See table 1 .) using the design-evolution strategy described above with 
fragments fused to the GCN4 leucine zippers as a test oligomerization 
domains. The complementary fusions are cotransfected and the proteins 
transiently expressed in COS-7 cells, or stability expressed in CHO cells, 
20 grown in the selective medium containing 20-deoxycoformycin. In the 
case of CHO cells, colonies are collected and sequentially re-cultured at 
increasing concentrations of 20-deoxycoformycin in order to enrich for 
populations of cells that efficiently express the fusions at high 
concentrations 

25 

6) Bleomycin binding protein (zeocin resistance gene) Zeocin, a 
member of the bleomycin/phleomycin family of antibiotics, is toxic to 
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bacteria, fungi, plants, and mammalian cells. The expression of the 
zeocin resistance gene confers resistance to bleomycin/zeocin. The 
protein confers resistance by binding to and sequestering the drug and 
thus preventing its association and hydrolysis of DNA. Berdy, J. (1980) 
5 In Amino Acid and Peptide Antibiotics, J. Berdy, ed. (Boca Raton, FL: 
CRC Press), pp.459-497; Mulsant, P., Tiraby, G., Kallerhoff, J., and 
Perret, J. (1989 Somat. Cell. Mol. Genet. 14, 243-252). Bleomycin binding 
protein (BBP) could then be used for a PCA based on cell survival. As 
with the other systems described above, fragments of ADA can be 
1 o generated based on the known structure (See table 1 .) using the design- 
evolution strategy described above with fragments fused to the GCN4 
leucine zippers as a test oligomerization domains. The BBP is a small (8 
kD) dimer that binds to drugs via a subunit interface binding site. For this 
reason, the design would be somewhat different in that first, a single 
1 5 chain form of the dimer would be generated by making a fusion of two 
BBP genes with a short sequence coding for a simple polypeptide linker 
introduced between the two subunits. Fragments in this case will be 
based on a short sequence of one of the subunit modules, while the other 
fragment will be composed of the remaining sequence of the subunit plus 
20 the other subunit. Complementation and selection experiments will be 
performed as described for the examples above using bleomycin or 
zeocin as selective drugs. 

7) Hygromycin-B-phosphotransferase The antibiotic hygromycin-B is 
25 an aminocyclitol that inhibits protein synthesis by disrupting translocation 
and promoting misreading. The E. coli enzyme hygromycin-B- 
phosphotransferase detoxifies the cells by phosphorylating Hygromycin- 
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B. When expressed in mammalian cells, hygromycin-B- 
phosphotransferase can confer resistance to hygromycin-B ( Gritz, L., 
and Davies, J. (1983) Gene 25, 179-188.). The enzyme is a dominant 
selectable marker and could be used for a PCA based on cell survival. 
5 While the structure of the enzyme is not known it is suspected that this 
enzyme is homologous to aminoglycoside kinase (Shaw, et al. (1993) 
Microbiol. Rev. 57, 138-163). It is therefore possible to use the combined 
design/evolution strategy to produce a PCA with this enzyme and perform 
dominant selection in mammalian cells with selection at increasing 
10 concentrations of hygromycin B. 

8) L-histidinol NAD+oxydoreductase The hisD gene of Salmonella 
typhimurium codes for the L-histidinol NAD+oxydoreductase that 
converts histidinol to histidine. Mammalian cells grown in media lacking 

15 histidine but containing histidinol can be selected for expression of hisD 
(Hartman, S. C, R. C. Mulligan (1988) Proc. of the Nat. Acad. Sci. (USA) 
85, 8047-8051). An additional advantage of using hisD in dominant 
selection is that histidinol is itself toxic, inhibiting the activity of 
endogenous histidyl-tRNA synthetase. Histidinol is also inexpensive and 

20 readily permeates cells. The structure of histidinol NAD+oxydoreductase 
is unknown and so development of a PCA based on this enzyme is based 
entirely on the exonuclease fragment/evolution strategy. 

The following Table list alternative embodiments using other PCA 
reporters. Abreviations in Table: Type: D, dominant selection marker; R, 

!5 recessive selection marker. Structure: four letter codes= Protein Data 
Bank (PDB) entries; K, known but not deposited in PDB; U, unknown, 
mono/oligo: M, monomer; D, dimer; tetra, tetramer. 
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TABLE 1. A list of Other Potential PCA Reporter Candidates 
A-Assays based on Dominant or Recessive Selection 



5 



10 



15 



20 



25 



30 





Enzyme 


Type i 


Structure 


I mono/ 
Slze oligo 


1 Selection 
druqs/Conditions 


DHFR 


i R/D r 


Tiany 


!8kD M 


methotrexate/trimetho 
1 prim 


Adenosine deaminase 


[5/R 


1ADD 


|M 


Xyl-A or 

adenosine.alanosine, 












and 2'- 

1 deoxycoformycin 
| gangcyclovene, HAT 


| Thymidine kinase 

Mutant hypoxanthine- guanine 
I phosphoribosyl transferase 


D/R 
I D 


1KIN 
1 HCjM 


P 
P 


HAT + thymidine 
kinase 


| Thymidylate synthetase 


R 


1NJE 


35kd M 


2 fluorodeoxyuridine 


Xanthine-guanine 
I phosphoribosyl transferase 


P 


1NUL 


mycophenolic acid 
with limiting xanthine 


I Glutamine synthetase 
Asparagine synthetase 


R 
R 


2LGS 
U 


B-aspartyl 
hydroxamate or 
albizin 


I Puromycin A/- 
I acetyltransferase 


P 


U 


23kD M 


puromycin 


Aminoglycoside 
phosphotransferase 


P 


K 


35kD M 


1 neomycin, G418, 
gentamycin 


I Hygromycin B 
nhnsnhotransferase 


P 


U 


M 


I hygromycin B 




L-histidinol:NAD+ 
nvidoreductase 


P 


U 


46kD M 


I histidinol 


I Ripnmvcin bindinq protein D 


K 


8kD p 


I bleomycin/zeocin 




Cytosine methyl-transferase 


I R/D 


U 


5-Azacytidine (5-aza- 
CR) and 5-aza-2*- 
deoxycytidine 




06-alkylguanine 
alkyttransferase 


P 


1ADN 


j N-methyl-A/- 
nitrosourea 




Glycinamide ribonucleotide 
transformylase 


R 


1GRC 


23.2 p 
kD 


dideazatetrahydrofolat 
| e, minus purine 




Glycinamide ribonucleotide 
synthetase 


R 


U 


45.9 
kD 1 


I minus purine 
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Enzyme 


Type 


Structure 


Size 


mono/ 
oligo 


Selection 
drugs/Conditions 


Phosphoribosyl- 
arninoirniuazoie synmetase 


R 


U 


36.7 
kD 




minus purine 


Formylglycinamide ribotide 
amidotransferase 


R 


u 


141. 
4kD 


M 


L-azaserine, 6-diazo- 
5-oxo-L-norleucine, 
minus purine 


Phosphoribosyi- 
aminoimidazole carboxylase 


R 


u 


39.5 
kD 


D 


minus purine 


Phosphoribosyl- 
aminoimidazole carboxamide 
formyltransferase 


R 


u 


57.3 
kD 




minus purine 


Fatty acid synthase 


R 




272k 
D 


D 


ceruienin 


IMP dehydrogenase 


R 


1AK5 


55.4 
kD 


Tetra 


mycophenolic acid 



15 ii-Viral Plaque Assays 



Enzyme 


Type 


Structure 


Size 


Mono/ 
Oligo 


Selection 
drugs/Conditions 


Thioredoxin 


D 


1TDF 


34.5kD 


D 




Reverse transcriptase 


D 


3HVT 








Viral protease 


D 











B-Cell Death Assays 
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30 



Enzyme 


Type 


Structure 


Size 


Mono/ 
Oligo 


Selection 
drugs/Conditions 


Cysteine protease: papain 


D 


1STF 


38.9kD 


M 


inhibited by cystatin 


Cysteine protease: 
caspase 


D 


1CP3 


17kD + 
12kD 


Hetero 
D 


inhibited by DEVD-aldehyde 
(can also by used in a 
fluorimetric or coiorimetric 
assay, in vitro) 


Metailoprotease: 
carboxypeptidase 


D 




47.1kD 


M 


inhibited by methyl-ethyl 
succinic acid 


Serine protease: 
proteinase K 


D 


1PTK 


30.6kD 


M 


inhibited by serpins 
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Enzyme 


Type 


Structure 


Size 


Mono/ 
Oligo 


Selection 
drugs/Conditions 


Aspartic protease, pepsin 




1PSN 


34.5kD 


M 


inhibited by pepstatin A (can 
also be used in an 
fiuorimetric assay, in vitro) 


Lysozyme 


D 


many 


23.2kD 


M 


inntDiiea uy in- 

acetylglucosamine 

trisaccharide 


RNAse 


D 


many 


13.3kD 


M 


inhibited by RNAse inhibitor 


DNAse 


D 


1DNK 


61.6kD 


M 


inhibited bv actin 


Phospholipase A2 


D 


1P2P 


13.8kD 


M/D 


many inhibitors: 
bromophenacyi bromide, 
hexadecyl-trifluoroethyl- 
glycero-phosphomethanol, 
bromoenol lactone, etc. 


Phospholipase C 


D 


1AH7 


28kD 


M 


many inhibitors: neomycin, 
chelerythrine. U73122, etc. 



C-Colorimetric/Fluorimetric Assay 



15 



20 



Enzyme 


Structure 


Size 


Mono/ 
Oligo 


Selection drugs/Conditions 


DT-Diaphorase (NAD(P)H- 
[quinone acceptor] 
oxidoreductase) 


1QRD 


26kD 


D 


NADPH-diaphorase stain, 
inhibited by dicumarol, 
Cibacron blue and phenidione 
Note: can also be used in a cell 
death assay 
(+nitrobenzimidazole, fo 
example). 


(NAD(P)H-[quinone 
acceptor] oxidoreductase)-2 


isoform of 
1QRD 


21kD " 


D 


NRH-diaphorase stain, 
inhibited by 
pentahydroxyflavone 


Thermophilic diaphorase 
(Bacillus 

stearothermophilus) 




30kD 


M 


NADH-diaphorase stain 


Glutathione-S-transferase 


1GNE 


26kD 
other 
isoform 
of28kD 


D 
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EXAMPLE 4 

Examples of Variants of PCA to detect multiple protein/protein- 
dna/protein RNA/protein-drua complexes 

20 To this point specific examples have only been made of 

applications of PCA to protein-pair interactions. However, it is possible 
to apply PCA to multiprotein, prptein-RNA, protein-DNA or protein-small 
molecule interactions. There are two general schemes for achieving such 
systems. Multi-subunit PCA: Two proteins need not interact for a PCA 

25 signal to be observed; if a partner protein or protein complex binds to two 
proteins simultaneously, it is possible to detect such a three protein 
complex. A multusubunit PCA is conceived with the example of herpes 
simplex virus thymidine kinase (TK), a homodimer of 40 kD . In this 
conception, the TK structure contains two well defined domains consisting 
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of an alpha/beta (residues 1-223) and an alpha-helical domain (224-374). 
As a test system, we use the Rop1 dimer, a four helix bundle homodimer. 
The two fragments of TK are extracted by PCR and subcloned into the 
transient transfection vector pMT3, the first in tandem to the Adenovisus 
5 major late promoter, tripartite leader 3' to the first ATG, and the second 
downstream of a ECMV internal initiation site. Restriction sites previously 
introduced between the first and the last ATG are subcloned into BamHI/ 
Kpnl and Pstl/EcoRI cloning sites downstream of the two ATGs. These 
are used to subclone PCR-generated fragments of the Rop1 subunits into 
10 two different vectors. Subsequently Ltk- cells are cotransfected by 
lipofection with the two plasmids and colonies of surviving cells are 
serially selected in medium containing increasing concentrations of HAT 
(hypoxanthine/ aminopterin/thymidine). Cells that express 
complementary fragments of TK fused to the four Rop1 will proliferate 
15 under this selective pressure, or otherwise die. Specific examples of use 
of this concept would be in determining constituents of multiprotein 
complexes that are formed transiently or constitutively in cells. 

The utility of PCA is not limited to detecting protein- 
protein interactions, but can be adapted to detecting interactions of 
20 proteins with DNA, RNA, or small molecules. In this conception, two 
proteins are fused to PCA complementary fragments, but the two proteins 
do not interact with each other. The interaction must be triggered by a 
third entity, which can be any molecule that will simultaneously bind to the 
two proteins or induce an interaction between the two proteins by causing 
25 a conformational change in one or both of the partners. Two examples 
have been demonstrated in our lab using the mDHFR PCA in E. coli. In 
the first case a natural product, the immunosuppressant drug rapamycin, 
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is used to induce an interaction between its receptor FKBP12 and a 
partner protein mTOR ( mammalian Target of Rapamycin). We detect 
this by cotransformation of DHFR fragments fused to FKBP or mTOR into 
E. coli grown in the presence or absence of trimethoprim (as described 
5 above) and rapamycin (0- 10 nM). We have demonstrated that support 
of growth as detected by colony formation is completely dependent on the 
addition of rapamycin, suggesting that the mDHFR PCA is detecting a 
rapamycin-induced assembly of a FKBP12-mTOR and subsequent 
reconstitution of DHFR activity. This is one example of a use of the PCA 

10 strategy to test for small molecules that can induce interactions between 
proteins. General applications could be made to therapeutic 
development, in the form screening small molecule combinatorial 
compound libraries for molecules that induce interactions between 
proteins, that may inhibit the activities of either or both of the proteins, or 

1 5 activate specific cellular processes that are initiated by other events, such 
as growth factor-mediated receptor dimerization. The discovery of such 
small molecules could lead to the development of orally available drugs 
for the treatment of a broad spectrum of human diseases. 

Another example of an induced interaction we have 

20 studied with the DHFR PCA is the interaction of the oncogene GTPase 
p21 ras and its direct downstream target, the serine/threonine kinase raf. 
This interaction only occurs when the GTPase is in the GTP-bound form, 
whereas turnover of GTP to GDP leads to release of the complex. As 
with the FKBP-mTOR complex, we have demonstrated this induced 

25 interaction in E. coli. PCA could be used in a general way to study such 
induced interactions, and to screen for compounds that release or 
prevent these interactions in pathological states. The ras-raf interaction 
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itself could be a target of therapeutic intervention. Oncogenic forms of 
ras consist of mutants that are incapable of turning over GTP and 
therefore remain continuously associated with activated ras. This leads 
to a constitutive uncontrolled growth signal that results, in part, in 
5 oncogenesis. The identification of compounds that inhibit this process, 
by PCA, would be of value in broad treatment of cancers. Other 
examples of multimolecular applications of PCA could include 
identification of novel DNA or RNA binding proteins. In its simplest 
conception one uses a known DNA or RNA binding motifs, for instance 

10 a retinoic acid receptor zinc finger, or a simple RNA binding protein such 
as IF-1 , respectively. One half of the PCA consists of the DNA or RNA 
protein binding domain fused to one of the PCA fragments (control 
fragment). The complementary fragment is fused to a cDNA library. A 
third entity, the gene coding for a sequence containing an element known 

15 to bind to the control protein, and then a second putative or known 
regulatory element is coded for after this sequence. A test system 
consists of tat/tar elements that control elongation in 
transcription/translation of HIV genes. An example application would be 
identification of tat binding elements that have been proposed to exist in 

20 eukaryotic genomes and may regulate genes in the same or similar way 
to that of HIV genes. (SenGupta D. J. et al. (1996) Proc. Natl. Acad. USA 
6, 8496-8501). 

EXAMPLE 5 

25 Examples of PCA applications to drug screening: Screening 
combinatorial libraries of compounds for those that inhibit or induce 
protein-protetn/protein-rna/protein-PNA complexes 



BNSDOCID:<WO 9834120A1> 



WO 98/34120 PCT/CA98/00068 

63 



A) Drug screening 

Screening combinatorial libraries of compounds for 
those that inhibit or induce protein-protein/protein-rna/protein-DNA 
complexes. The PCA strategy can be directly applied to identifying 
5 potentially therapeutic molecules contained in combinatorial libraries of 
organic molecules. It is possible to perform high throughput screening of 
such libraries to screen for compounds that will inhibit or induce protein- 
protein interactions or protein-DNA/RNA interactions (as discussed 
above). In addition it is also possible to screen for compounds that inhibit 

10 enzymes whose substrates are other proteins DNA, RNA or 
carbohydrates, as discussed below. In this application, proteins that 
interact/protein substrate pairs, or control DNA/RNA binding protein- 
enzyme pairs are fused to PCA complementary fragments and plasmids 
harboring these pairs are transformed/transfected into a cell, along with 

15 any third DNA or RNA element as the case requires. 
Transformed/transacted cells are grown liquid culture in multiwell plates 
where each well is inoculated with a single compound from an array of 
combinatorially synthesized compounds. A readout of a response 
depends on the effect of a compound. If the compound inhibits a protein 

20 interaction, there is a negative response (no PCA signal is the positive 
response). If the compound induces a protein interaction, the response 
is a positive PCA signal. Controls for non-specific effects of compounds 
include: 1) demonstration that the compound does not effect the PCA 
enzyme itself (test against cells transfected with the wild-type intact 

25 enzyme used as the PCA probe) and in the case of a cell survival assay, 
that the compound is not toxic to the cells that have not been 
transformed/transfected. As well as providing a high throughput assay for 
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biological activity of compounds, PCA also offers the advantage over in 
vitro assays that it is a test for cell membrane permeability of active 
compounds. Specific demonstrated examples of PCA for drug screening 
in our laboratory include the application of DHFR PCA in E. coli to 
5 detecting compounds that inhibit therapeutically relevant targets. These 
include Bax/Bcl2 fkbp12/tor ras/raf, carboxyl terminal dimerization 
domain of HIV-1 capsid protein, IkB kinase IKK-1 and IKK-2 dimerization 
domains (leucine zippers and helix-loop-helix domains). In each case, 
the two proteins are subcloned 5* upstream of either F[1,2] or F[3] as 
10 described above. Plasmids harboring the complementary fragments are 
cotransformed into BL21 cells. Colonies from minimal medium plates 
containing IPTG and trimethoprim are picked, and grown in liquid medium 
under the same selective conditions and frozen stocks made. For a 
single screening cycle, a priming overnight culture is grown from frozen 
15 stocks in LB medium. A selective minimal medium containing 
trimethoprim, ampicillin, IPTG is aliquated at 25 ml into each well of a 384 
well plate. Each well is then inoculated with 1 ul of an individual sample 
from a compound array (ArQule Inc.) to give a final concentration of 10 
uM. Each well is then inoculated with 2 ml of overnight culture and 
20 plates are incubated in a specially adapted shaker bath at 37C. At 2 hour 
intervals, plates are read on an optical absorption spectroscopic plate 
reader coupled to a PC and spreadsheet software at 600 nm (scattering) 
for a period of 8 hours. Rates of growth are calculated from individual 
time readings for each well and compared to a standard curve. A "hit" is 
25 defined as a case where an individual compound reduces the rate of 
growth to less than the 95 % confidence interval based on the standard 
deviation for growth rates observed in all of the wells within the test plate. 
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"Near hits" are defined as those cases where growth rates are within the 
95 % confidence interval. For each of the hits or near hits, the following 
controls are then performed: The same experiment is performed with 
BL21 cells that are transformed with empty vector (and no trimethoprim), 
5 with vector harboring the full length mDHFR gene, or with cotransfected 
cells where protein expression is not induced by IPTG. If in all of these 
cases the compound has no effect, it can be concluded that it is 
specifically disrupting the protein-protein interaction being tested. Such 
validated hits or near hits are then retested to establish a dose-response 

10 curve for the individual compound, with concentrations varying from 1 pM 
up to 1 mM by orders of magnitude of 10. The PCA strategy for 
compound screening can also be applied in the multiprotein protein- 
RNA/DNA cases as described above, and can easily be adapted to the 
DHFR or any other PCA in E coli or in yeast versions of the same PCAs. 

15 Such screening can also be applied to enzymes whose targets are other 
proteins or nucleic acids for known enzyme/substrate pairs or to novel 
enzyme substrate pairs identified as described below. 
Proteins involved in viral integration processes are examples of targets 
that could be tested for inhibitors using the PCA strategies. Examples for 

20 the HIV virus include: 

i) Inhibition of integrase or the transport of the pre-integration complex: 
protein Ma or vpr. 

25 ii) Inhibition of the cell cycle in G2 by vpr (interaction by cyclin B) causing 
induction of apoptosis. 
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iii) Inhibition of the interaction of gp160 (precursor of the membrane 
proteins) with furine. 

Accessory proteins of HIV as a therapeutic target: 
5 i) Vpr: nuclear localizing sequence (target): interaction site of vpr with 
phosphatasesA. 

ii) vif: interaction with vimentin (cytoskeleton associated protein) . 

i0 ii) Vpu: Degradation of CD4 in the RE mediated by the cytoplasmic tail of 
Vpu. 

iii) nef: Myristoylation signal of Nef. 

15 Other general targets for drug screening could include 

proteins linked neurodegenerative diseases, such as to alpha-synuclein. 
This protein has been linked to early onset of Parkinson disease and it is 
present also implicated in in Alzheimer disease. There is also b-amyloid 
proteins, linked to Alzheimers disease. 

20 An example of protein-carbohydrate interactions that 

would be a target for drug screening includes the selectins that are 
generally implicated in inflammation. These cell surface glycoproteins are 
directly involved in diapedesis. 

A number of tumor supressor genes whos actions are 

25 mediated by protein-protein interactions could be screened for potential 
anti-cancer compounds. These include PTEN, a tumor supressor directly 
involved in the formation of harmatomas. It is also involved in inherited 
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breast and thyroid cancer. Other interesting tumor supressor genes 
include p53, Rb and BARC1. 

EXAMPLE 6 

5 Examples of applications the PCA strategy to detect 
enzyme/substrate interactions 

The examples described above are used for identifying 
novel molecular interactions involving molecules that merely bind to each 
other. However detecting the substrates of enzymes is also fully 
1 0 compatible with the PCA strategy as shown below: 

i) Enzymes that form tight complexes or with protein substrates or induce 
efficient PCA fragment assembly or 

15 ii) Mutant enzymes that bind tightly to substrate but do not undergo 
product release because of mutations residues involved in nucleophilic 
attack and/or product release (substrate trapping). 

Enzymes may form tight complexes with their substrates 
(Kd -1-10 mM). In these cases PCA may be efficient enough to detect 

20 such interactions. However, even if this is not true, PCA may work to 
detect weaker interactions. Generally, if the rate of catalysis and product 
release is slower than the rate of folding- reassembly of the PCA 
complementary fragments, effectively irreversible folding and 
reconstitution of the PCA reporter activity will have occurred. Therefore, 

25 even if the enzyme and substrate are no longer interacting, the PCA 
signal is detected. Therefore, the detection of novel enzyme substrates 
using PCA may be possible, independent of effective substrate Kd or rate 
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of product release. In cases where product release is much faster than 
PCA fragment assembly/folding and alternative approach is provided by 
generating "substrate trapping" mutants of the test enzyme. An example 
of this approach applied to the protein tyrosine phosphatase PTP1B, 

5 where substrate trapping mutants have been generated by mutating the 
nucleophilic aspartate 181 to alanine rendering the enzyme catalyticly 
dead, but capable of forming tight complexes with a known substrate, the 
EGF receptor and other unknown proteins (Flint, A. J. et al. (1996) Proc. 
Natl. Acad. USA 941680-1685). An application of using PCA to screen 

10 for interacting partners of PTP1B is given as follows. We use the 
aminoglycoside kinase (AK)-based PCA in transiently transfected COS 
or 293 cells. The substrate trap mutant catalytic domain of PTP1B is 
fused to AMerminal complementary fragment of AK, while a C-terminal 
fusion of the other AK fragment is made to a cDNA library. Cells are co- 

15 transfected with complementary AK pairs and grown in selective 
concentrations of G418. After 72 hours, colonies of surviving cells are 
picked and in situ PCR is performed using primers designed to anneal to 
3" and 5' flanking regions of the cDNA coding region. PCR amplified 
products are then 5' sequenced to identify the gene. 

20 Enzyme inhibitors Screening combinatorial libraries of compounds for 
those that inhibit enzyme-PROTEIN substrate complexes either with: 

i) Enzymes that form tight complexes with protein substrates or 

ii) Mutant enzymes that bind tightly to substrate but do not undergo 
25 product release because of the mutation. 
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EXAMPLE 7 

Applications of the PCA str ategy to protein engineering/evolution 

The PCA strategy can be used to generate peptides or 
proteins with novel binding properties that may have therapeutic value, 
as is done today with phage display technology. It is also possible to 
develop enzymes with novel substrate or physical properties for industrial 
enzyme development. Two detailed examples of the application of the 
PCA strategy to these ends are given below, with additional applications 
listed below. 

1) Selection of high-affinity, heterodimerizing leucine zipper 
sequences (J. Pelletier, K. Arndt, A. Plueckthun and S. Michnick, 
manuscript in preparation). The mDHFR PCA, described above, was 
used in a scheme for the selection of efficiently heterodimerizing, 
designed leucine zippers. It has been proposed that the formation of salt 
bridges between positively and negatively charged residues at 
complementing C-eC- and 6g6 positions is important in stabilizing leucine 
zipper formation, though this view has been contested. In order to help 
define the importance of salt-bridge formation at the e and g positions, 
two leucine zipper libraries were built. Both are based on the GCN4 
leucine zipper sequence, but contain sequence information specific to 
either Jun or Fos zippers in order to create heterodimerizing pairs. As 
well, the e-1 to e-4 and g-1 to g-4 positions in each library were 
randomized to code for positively or negatively charged residues, or 
neutral polar residues. These libraries were amplified by PCR and 
subcloned into the Z-F[1,2] or Z-F[3] constructs (described above) from 
which the GCN4 zipper sequences had been removed. The bacterial 
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mDHFR PCA selection was performed on selective solid media, as 
described earlier. Colonies were picked and sequenced; sequence 
analysis reveals that the distribution of charged or neutral residues at e-g 
pairs is not random, but is biased toward pairing of opposite charges, or 
5 pairing of a charged with a neutral residue, rather than same-charge 
pairing (see figure 7). We reasoned that better zipper pairing should lead 
to an increase in efficiency of DHFR-fragment complementation, resulting 
in faster bacterial doubling times (see Table 1 in the mDHFR PCA 
description), and undertook a selection/enrichment of the novel zippers 
10 relative to GCN4, as follows. The designed zipper libraries, expressed as 
N-terminal fusions to the DHFR F[1 ,2] or F[3:I1 14A], were cotransformed, 
clones were picked, propagated and mixed in selective liquid culture, and 
the mix was added in a 1 :1 000 000 ratio to clone Z-F[1 ,2] + Z-F[3:I1 14A] 
(original GCN4 leucine zippers). The mixture was propagated in selective 
15 liquid culture over multiple passages. Restriction analysis shows that 
within 4 passages, the population of GCN4-expressing bacteria is 
diminishing relative to the novel zipper sequences (data not shown), 
indicating that some of the designed zipper-containing clones are 
propagated at a higher rate than those containing GCN4. Bacteria from 
20 later passages were plated on selective medium, and individual clones 
sequenced to reveal the identity of the most successful designed zipper 
pairs (data not shown). 

2) Application of PCA to enzyme function and design PCA 

25 Development Adenosine deaminase (ADA) meets all of the criteria for 
a PCA listed above. ADA is a small (-40 kD), and easily purified 
monomeric zinc metallo-enzyme and the structure of murine ADA has 
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been resolved. Several in vitro ADA activity assays have been 
developed, involving UV spectrophotometry and stopped-flow fluorimetry. 
E. coli ADA catalyzes the irreversible conversion of cytotoxic adenine 
nucleosides to non-toxic inosines . 
5 Eukaryotic or prokaryotic cells propagated in the 

presence of cytotoxic concentrations of adenosine or adenosine analogs 
require ADA to detoxify these compounds. This is the basis of a 
dominant-selection strategy used to select for cells expressing a specific 
gene in mammalian cells. The ADA gene has also been expressed in 

10 SF3834 E. coli cells which lack a gene coding for endogenous ADA. 
When the gene coding for ADA is introduced into ADA- bacterial DNA, 
those cells that express ADA are able to survive high concentrations of 
added adenosine; those that do not r die . This forms the basis of an in 
vivo ADA activity assay. 

15 We chose ADA, principally because it can be used as 

a dominant selective marker in mammalian and bacterial cells where the 
gene has been knocked out. The reason we choose dominant selective 
genes is because in screening for novel protein-protein interactions, 
particularly testing for interactions of a known protein against a library of 

20 millions of independent clones, selection serves to filter for cells that may 
show a positive response for reasons having nothing to do with a specific 
protein-protein interaction. We will use three test systems of interacting 
proteins including leucine zipper-forming sequences, the proteins raf and 
p21 and the induced oligomerization system, FK506 binding protein 

25 (FKBP) and mTOR that interact through the macrocyclic immuno- 
suppressant compound rapamycin. For all of these systems, we will 
construct E. coli and mammalian transient transfection plasmids and 
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subclone the test proteins as fusions to ADA complementary fragments. 
The primary assay will be survival of SF3834 E. coli cells that have been 
transformed with the complementary ADA fragments fused to the test 
oligomerization proteins in the presense of toxic concentrations of 

5 adensosine. We will then purify fusion proteins from colonies of and 
perform in vitro assays of ADA activity as described below. The utility of 
the ADA PCA as a method to identify novel proteins that interact with a 
test bait will be performed in mammalian COS-7 and HEK-293T cells 
transiently transfected with FKBP fused to one of the ADA fragments and 

10 the other fragment fused to a cDNA library from normal human spleen 
containing 10 6 independent clones. As with the E. coli assay, cells that 
survive in a medium containing toxic concentrations of ADA is collected 
and isolated plasmids will be testd to identify the gene for the interacting 
protein by PCR amplification and chain propagation-termination 

15 techniques. 

Structural motifs required for protein function: Determination of the 
structural elements required for the enzymatic function of ADA are 
investigated through alteration of the structures of the enzyme fragments. 

20 At first, ADA is cut into two separate domains - one responsible for 
substrate binding (residues 1-210) and one responsible for catalysis 
(residues 211-352). These separate pieces will be attached to known 
assembly domains, such as leucine zippers (see example 1 above). 
Reassembly will restore activity which will be assessed through detailed 

25 in vitro kinetic analysis of the binding and catalytic properties of the re- 
assembled enzyme, using UV spectrophotometry and stopped-flow 
fluorimetry to observe the enzymatic reactions. This system will provide 
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another handle on the manipulation of enzyme activity that will afford a 
powerful tool for enzymatic mechanism study. For example, the 
difference in the kinetic behaviour of the reassembled enzyme on mixing 
with the substrate, compared to enzyme reassembled in presence of 
5 substrate (where substrate may already be bound by binding domain) will 
allow sophisticated level of study of importance of binding energy to 
catalysis. Subsequent point mutations to the functional or assembly 
domains of the proteins will then allow a very subtle perturbation and 
detailed quantification of the relationship of binding energy to catalysis. 
10 This precise control over the structure and assembly of separate 
functional domains of the enzyme will permit very sophisticated enzymatic 
structure function studies, the definition of structural motifs and an 
understanding of their role in catalysis. 

15 Novel protein catalyst design: The detailed knowledge of the enzyme 
mechanism gained through determination of the structural requirements 
for catalysis will then be exploited through the combination of these 
functional Obuilding blocks6 with the functional motifs responsible for 
substrate binding and catalysis in other enzymes, allowing the generation 

20 of novel protein catalysts. For example, the catalytic motif from ADA is 
modified to a cytidine-binding motif, creating a novel enzyme with 
potentially useful catalytic properties. The activity of these novel 
enzymes can easily be assessed through in vivo assays similar to that 
of the PCA system, or through in vitro activity assays. Furthermore, the 

25 detailed mechanistic investigation of the resulting enzymes possible with 
this, system will permit the rational design of each subsequent generation 
of catalysts. 
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EXAMPLE 8 

Examples of applications the PCA strategy to detect molecular 
interactions in whol e organisms 

It is a logical extension of the descriptions of PCA 
5 applications above to the utility of these techniques in whole model 
organisms such as drosophila, nematodes, zebra fish and puffer fish, as 
examples. The sole differences with other listed examples is that vectors 
used would need to be different (for example retroviral vectors) and that 
any substrates needed by the PCA would need to be bioavailable, or 
10 detection would need to be performed in situ. 

EXAMPLE 9 

Examples of applications of the PC A strategy to Gene Therapy 

Another important embodiment of the invention is to 
1 5 provide a means and method for gene therapy of mammalian disease. Of 
particular interest is the use of PCA therapeutic for treatment of cancer. 
In one embodiment of said PCA gene therapy, a PCA is developed 
employing fragments (modular protein units) derived from a protein toxin 
for example: Pseudomonas exotoxin, Diptheria toxin and the plant toxin 
20 gelonin, or other like molecules. For therapy of breast cancer for example, 
first a mammalian, retroviral, adenoviral, or eukaryotic artificial 
chromosomal (EAC's) genetic construct is prepared that introduces one 
fragment of the selected toxin under the control of the promoter for 
expression of the eroB2 oncogene. Its is well known that the enbB2 
25 oncogene is overexpressed in breast cancer and adenocarcinoma cells 
( D. J. Slamon et. al., Science, 1989, 244, 707 ). The HER2/neu (c-erfcB- 
2) proto-oncogene encodes a sub-class 1 185-kDa transmembrane 
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protein tyrosine kinase growth factor receptor, p185 HER2 . Also, the human 
erbB2 oncogene is located on chromosome 17, region q21 and 
comprises 4,480 base pairs and p185 HER2 serves as a receptor for a 30- 
kDa glycoprotein growth factor secreted by human breast cancer cell lines 
5 ( R. Lupu et. Al., Science, 1990, 249, 1152 ). 

The transgene is introduced 'in vivo' or 'ex-vivo' into 
target cells employing methods known by those skilled in the art e.g. 
homologous recombination to insert transgene into locus of interest via 
retroviral, adenoviral or EAC's. A second genetic construct comprising a 

0 fusion gene containing a target DNA that encodes an interacting protein 
that interacts with er6B2 oncogene discovered by the PCA process 
described in this invention and the "second" fragment of the toxin 
molecule. This construct is delivered to the patient by methods known in 
the art for example as shown in U.S. Patent Nos. 5,399,346 and 

5 5,585,237 whose entire contents are incorporated by reference herein. 
Transgene expression of the eroB2 oncogene-toxin fragment described 
will now be under the control of the constitutive oncogene promoter. 
Proliferating tumor cells will thus produce one piece of the toxin attached 
as a fusion to the eroB2 oncogene. In the presence of the second 

) genetic construct expressing the PCA discovered interacting erbB2 
oncogene "interacting protein - toxin fragment" construct then: erbB2 
oncogene-toxin fragmentA: interacting protein-toxin fragment B will be 
created and induce death of target tumor cells through creation of an 
active toxin through Protein Fragment Complementation and thus provide 

1 an efficacious and efficient therapy of said disease. 

This can be extended to other diseases and other toxins 
employing techniques described and embodied in this invention. 
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FX AMPLE 10 

FrnTrr — ^ ap plication* *h» PHA stratrnv to detect molecular 

Any of the PCA strategies described above could be 
addapted to in vitro detection. Unlike the in vivo PCAs however, 
detection would be performed with purified PCA fragment-fusion proteins. 
Such uses of PCA have the potential for use in diagnostic kits. For 
example the test DHFR assay described above where the interacting 
domains are FKBP12 and TOR could be used as a diagnostic test for 
rapamycin concentrations for use in monitoring dossage in patients 

treated with this drug. 

As shown above, the instant invention provides: 

1) Allow for the detection of protein-protein interactions in vivo or in vitro. 

2) Allow for the detection of protein-protein interactions in appropriate 
contexts, such as within a specific organism, cell type, cellular 
compartment, or organelle. 

3) Allow for the detection of induced versus constitutive protein-protein 
interactions (such as by a cell growth or inhibitory factor). 

4) To be able to distinguish specific-versus non-specific protein-protein 
20 interactions by controlling the sensitivity of the assay. 

5) Allow for the detection of the kinetics of protein assembly in cells. 

6) Allow for screening of cDNA libraries for protein-protein interactions. 

Further aspects of the invention can be demonstrated 
by identifying novel interactions with the enzyme p70S6k, to determine its' 
regulation and how separate signaling cascades converge on this 
enzyme. 
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The PCA method is particularly useful for detection of 
the kinetics of protein assembly in cells. The kinetics of protein assembly 
can be determined using fluorescent protein systems. 

In a further embodiment of the invention, PCA can be 
5 used for drug screening. The techniques of PCA are used to screen for 
drugs that block specific biochemical pathways in cells allowing for a 
carefully targeted and controlled method for identifying products that have 
useful pharmacological properties. 

Although the present invention has been 
10 described herein above by way of preferred embodiments thereof, it can 
be modified, without departing from the spirit and nature of the subject 
invention as defined in the appended claims. 
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WHAT IS CLAIMED IS; 



1 . Molecular fragment complementation assays for the 
detection of molecular interactions comprising a reassembly of separate 
5 fragments of a molecule, wherein reassembly of said fragments is 
operated by the interaction of molecular domains fused to each fragment 
of said molecules, and wherein reassembly of the fragments is 
independent of other molecular processes. 

10 2 A method for detecting biomolecular interactions said 

method comprising: 

(a) selecting an appropriate reporter molecule; 

(b) effecting fragmentation of said reporter molecule 
such that said fragmentation results in reversible loss of reporter function; 

(c) fusing or attaching fragments of said reporter 
molecule separately to other molecules; followed by 

(d) reassociation of said reporter fragments through 
interactions of the molecules that are fused to said fragments. 



15 



20 



25 



3. The method of claim 2 wherein said reporter 
molecule is a multimeric protein. 

4. The method of claim 2 wherein said reporter 
molecule is an multimeric enzyme. 

5. The method of claim 2 wherein said reporter 
molecule is a multimeric receptor. 
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6. The method of claim 2 wherein said reporter 
molecule is a multimeric binding protein. 

7. The method of claim 2 wherein said reporter 
5 molecule is a catalytic molecule. 

8. The method of claim 2 wherein said reporter 
molecule is an energy transfer molecule. 

1Q g. The method of claim 2 wherein said reporter 

molecule is a fluorescent or luminescent or phosphorescent protein. 

10. The method of claim 2 wherein said detected 
molecule is a nucleic acid or a ribozyme. 

15 

11. The method of claim 2 wherein said detected 
molecule is a lipid or an oligosaccharide. 

12. The method of claim 2 wherein said detected 
20 molecule is a ligand. 

13. The method of claim 2 wherein said detected 
molecule is a nucleic acid. 

25 !4. The method of claim 2 wherein said detected 

molecule is a peptide. 
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15. The method of claim 2 wherein said detected 
molecule is a carbohydrate. 

16. The method of claim 2 wherein said fragmentation 
5 is effected by a method selected from the group consisting of genetic 

manipulation, synthetic chemistry or de novo synthesis, photochemical or 
enzymatic cleavage, and proteolytic or hydrolytic chemistry. 

17. The method of claim 2 wherein said reassociation 
10 of the reporter molecule fragments is effected by molecules fused or 

attached to said fragments. 

18. A method of testing biomolecular interactions 

comprising: 

15 a) generating a first fusion product comprising 

i) a first fragment of a first molecule and 

ii) a second molecule which is different or the 
same as said first molecule; 

b) generating a second fusion product comprising 
20 i) a second fragment of said first molecule; and 

ii) a third molecule which is different from or the 
same as said first molecule or second molecule; 

c) allowing the first and second fusion products to 
contact each other; and 

25 d) testing for activity regained by association of the 

recombined fragments of the first molecule, wherein said reassociation is 
mediated by interaction of the second and third molecules. 



BNSDOCID: <WO 9834120A1> 




WO 98/34120 



PCT/CA98/00068 



94 



1 9. The method of claim 1 8 wherein at least one of said 
second or said third molecules is a protein. 

20. The method of claim 1 8 wherein at least one of said 
5 second or said third molecules is an enzyme. 

2 1 . The method of claim 1 8 wherein at least one of said 
second or said third molecules is a nucleic acid. 

10 22. A method comprising an assay where fragments of 

a first molecule are fused to a second molecule and fragment association 
is detected by reconstitution of the first molecule's activity. 

23. A composition comprising a product selected from 

1 5 the group consisting of: 

(a) a first fusion product comprising: 

1) a first fragment of a first molecule whose 
fragments can exhibit a detectable activity when associated and 

2) a second molecule that can bind (a)(1); 
20 (b) a second fusion product comprising 



c) 



1) a second fragment of said first molecule and 

2) a third molecule that can bind (b)(1); and 
both (a) and (b). 



25 



24. A composition comprising complementary 
fragments of a first molecule, each fused to separate molecules. 
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25. The composition of Claim 24 wherein the first 
molecule is selected from the group consisting of: multimeric protein, 
multimeric receptor, multimeric binding protein, catalytic molecule, energy 
transfer molecule, a fluorescent or luminescent or phosphorescent 

5 protein. 

26. The composition of Claim 24 wherein the first 
molecule is a multimeric enzyme. 

10 27 '■ Th © composition of claim 24 wherein the second 
and third molecules can bind to each other. 

28. A composition comprising a nucleic acid molecule 
coding for a fusion product, which molecule comprises sequences coding 
15 for a product selected from the group consisting of: 

(a) a first fusion product comprising: 

1 ) fragments of a first molecule whose fragments 
can exhibit a detectable activity when associated and 

2) a second molecule fused to the fragment of the 

20 first molecule; 

(b) a second fusion product comprising 

1) a second fragment of said first molecule and 

2) a second or third molecule; and 

(c) both (a) and (b). 

25 

29. A host cell comprising a composition according to 
either Claim 24 or Claim 28. 
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30. An assay which comprises using either the 
composition according to Claim 24 or Claim 28 or using the host cell 
according to Claim 29. 

31 . A method of testing for biomolecular interactions 
associated with: (a) complementary fragments of a first molecule whose 
fragments can exhibit a detectable activity when associated or (b) binding 
of two protein-protein interacting domains from a second or third 
molecule, said method comprising: 

1) creating a fusion of 

(a) a first fragment of a first molecule whose 
fragments can exhibit a detectable activity when associated and 

(b) a first protein-protein interacting domain; 

2) creating a fusion of 

(a) a second fragment of said first molecule and 

(b) a second protein-protein interacting domain 
that can bind said first protein-protein interacting domain; 

3) allowing the fusions of (1 ) and (2) to contact each 

other; and 

4) testing for said activity. 

32. The method according to any one of Claims 1 , 2, 
18, 22 or 31 wherein the activity of the fragments of said first molecule is 
controlled by changing parts of the fragments to either increase or 
decrease their ability to associate. 
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33. A composition comprising a product selected from 
the group consisting of: 

(a) a first fusion product comprising: 

1) a first fragment of a molecule whose 
fragments can exhibit a detectable activity when associated and 

2) a first protein-protein interacting domain; 

(b) a second fusion product comprising 

1) a second fragment of said first molecule and 

2) a second protein-protein interacting domain 
that can bind said first protein-protein interacting domain; and 

(c) both (a) and (b). 

34. A composition comprising a nucleic acid molecule 
coding for a fusion product, which molecule comprises sequences coding 
for either: 

(a) a first fusion product comprising: 

1) a first fragment of a molecule whose 
fragments can exhibit a detectable activity when associated and 

2) a first protein-protein interacting domain; or 

(b) a second fusion product comprising 

1 ) a second fragment of said molecule and 

2) a second protein-protein interacting domain 
that can bind said first protein-protein interacting domain; or 

(c) both (a) and (b). 

35. A host cell comprising a composition according to 
either Claim 33 or Claim 34. 
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36. An assay which comprises using either the 
composition according to Claim 33 or Claim 34 or using the host cell 
according to Claim 35. 

37. A method of detecting kinetics of protein assembly 
comprising performing PCA. 

38. A method of detecting kinetics of protein assembly 
comprising the method of any one of Claims 1, 2, 18, 22, or 31. 

39. A method of screening a cDNA library comprising 

performing PCA. 

40. A method of screening a cDNA library comprising 
the method of any one of Claims*! , 2, 18, 22, or 31. 

41 . A method according to any one of Claims 1 , 2, 18, 
22, 31, 39 or 40 wherein a chromogenic, fluorogenic, enzymatic, or other 
optically detectable signal is generated. 

42. A method according to any one of Claims 1 , 2, 1 8, 
22, 31, 39 or 40 wherein dihydrofolate reductase is used as part of a 
signal-generating system. 

43. A method of determining the minimum length of at 
least one of two or more interacting domains comprising performing any 
of the assays of Claims 1, 2, 18, 22, 31, 39 or 40. 
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44. A method of determining whether a molecular 
complex comprises two or more interacting domains, said method 
comprising performing any of the assays of Claims 1, 2, 18, 22, 31, 39 or 
40. 

45. A method according to any one of Claims 1 , 2, 1 8, 
22, 31, 39 or 40 wherein at least one other molecule is present which 
causes an interaction of said second and third molecules. 

46. A method according to any one of Claims 39 or 40 
where there is an interaction between parts of a first or a second 
molecule due to the presence of at least one other molecule which 
mediates the interaction. 



47. A method of testing the ability of a compound to 
inhibit molecular interactions in a PCA comprising performing a PCA in 
the presence of said compound and correlating any inhibition with said 
presence. 

48. A method for detecting protein-protein interactions 
in living organisms and or cells, which method comprises: 

(a) synthesizing probe protein fragments from an 
enzyme which enables dominant selection by dissecting the gene coding 
for the enzyme into at least two fragments; 

(b) constructing fusion proteins with one or more 
molecules that are to be tested for interactions; 
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(c) fusing the proteins obtained in (b) with one or 



more 



10 



15 



20 



of the probe fragments; 

(d) coexpressing the fusion proteins; and 

(e) detecting the reconstitution of enzyme activity. 

49. The method according to claim 48 wherein the 
enzyme is murine dihydrofolate reductase. 

50. The method according to claim 48 wherein the 
fusion proteins have peptides consisting of N- and C-terminal fragments 
of said murine dihydrofolate reductase and are fused to GCN4 leucine 
zipper sequences. 

51. The method according to claim 48 wherein 
coexpression of the complementary fusion proteins is catalyzed by the 
binding of the test proteins to each other. 

52. A method for screening or high-throughput 
screening of combinatorial libraries for compounds that trigger or inhibit 
protein-protein interactions, characterized by utilizing the method of claim 
48 to identify new drug targets. 

53. The method according to claim 52, wherein the 
targets are biological active proteins. 



25 
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54. The method of claim 53, wherein said biological 
active proteins are selected from the group consisting of receptors, 
inhibitors, enzymes or ion channels. 

5 55. A method for detecting biomolecular interactions 

said method comprising: 

(a) selecting an appropriate reporter molecule; 

(b) effecting fragmentation of said reporter molecule; 

(c) fusing or attaching fragments of said reporter 
10 molecule separately to other molecules; followed by 

(d) reassociation of said reporter fragments through 
interactions of the molecules that are fused to said fragments. 



15 



20 



56. The composition of Claim 55 wherein the first 
molecule is selected from the group consisting of: multimeric protein,- 
multimeric receptor, multimeric binding protein, catalytic molecule, energy 
transfer molecule, a fluorescent or luminescent or phosphorescent 
protein. 

57. The composition of Claim 55 wherein the first 
molecule is multimeric enzyme. 



58. In the method of affecting gene therapy, the step 
which comprises affecting the method of claim 2. 
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